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© Tumor necrosis factor receptor proteins, DNAs and expression vectors encoding TNF receptors, and 
processes for producing TNF receptors as products of recombinant cell culture, are disclosed. 
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TUMOR NECROSIS FACTOR-a AND RECEPTORS 



BACKGROUND OF THE INVEfTTION 



The present invention relates generally to cytokine receptors and more spedfically to tumor neaosis 
5 factor receptors. 

Tumor necrosis factor-a (TNFa, also known as cachectin) and tumor necrosis factor-^ (TNF^. also 
known as lymphotoxin) are homologous mammalian endogenous secretory proteins capable of inducing a 
wide variety of effects on a large number of cell types. The great similarities in the structural and functional 
characteristics of these two cytokines have resulted in their collective description as "TNF." Complementary 

10 cDNA clones encoding TNFa (Pennica et al„ Nature 312:724, 1984) and TNF^ (Gray et al.. Nature 
3/^721. 1984) have been isolated, pemiitting further structural and biological characterization of TNF. 

TNF proteins initiate their biologicaJ effect on cells by binding to specific TNF receptor (TNF-R) proteins 
expressed on the plasma membrane of a TNF-responsive cell. TNFa and TNFfl were first shown to bind to 
a common receptor on the human cervical carcinoma cell line ME-180 (Aggarwal et al.. Nature 

15 37ft665,1985). Estimates of the size of the TNF-R determined by affinity labeling studies ranged from 54 to 
175 kOa (Creasey et al. Proc NatL Acad. Sci. USA 54:3293, 1987; Stauber et al., J. Biol, Chem. 
263:19098. 1988; Hohmann et al.. J. fi/oA Chem. 264:14827. 1989). Although the relationship between 
these TNF-Rs of different molecular mass Is unclear, Hohmann et al, (7. Biol. Chem, 264:14827, 1989) 
reported that at least two different cell surface receptors for TNF exist on different cells kOa, respectively. 

20 None of the above publications, however, reported the purification to homogeneity of cell suriace TNF 
receptors. 

In addition to cell surface receptors for TNF, soluble proteins from human urine capable of binding TNF 
have also been identified (Peetre et al.. Eur. J. HaematoL 47:414, 1988; Seckinger et al., J. Exp. Med. 
/67:1511. 1988: Seckinger et al., J. BloL^gnfm. 264:11968, 1988; UK Patent Application, Publ. Nq.2 218 

25 101 A to Seckinger et al.; Engelmann et al7, J. Biol. Chem. rW:11974, 1989). The soluble urinary TNf" 
binding protein disclosed by UK 2 218 101 A has a partial N-terminal amino acid sequence of Asp*Ser-Vai- 
Cys-Pro-. which conresponds to the partial sequence disck^sed later by Engelmann et al. (1989). The 
relationship of tiie above soluble urinary binding proteins was further elucidated after original parent 
application (U.S. Serial No. 403241) of the present application was filed, when Engelmann et al. reported 

30 tine identification and purification of a second distinct jgiuble urinary TNF binding protein having an N- 
lerminai amino add sequence of Val-Ala-Phe-Thr-Pro- {J. Biol. Chem, 265:1531, 1990). The two urinary 
proteins disclosed by the UK 2 218 101 A and the Engelmann et al. publications were shown to be 
immunochemlcally related to two apparentiy distinct cell surface proteins by the ability of antiserum against 
the binding proteins to inhibit TNF binding to certain cells. 

35 More recently, two separata groups reported the molecular cloning and expression of a human 55 kDa 
TNF-R (Loetscher et al. Cell 6/:351. 1990; Schall et al.. Ceil 6/:361. 1990). The TNF-R of both groups has 
an N-terminal amino add sequence which corresponds to the partial amino acid sequence of the urinary 
binding protein disclosed by UK 2 218 101 A, Engelmann et al. (1989) and Englelmann et al. (1990). 

In order to elucidate the relationship of the multiple forms of TNF*R and soluble urinary TNF binding 

40 proteins, or to study the structmal and biological characteristics of TNF-Rs and the role played by TNF-Rs 
in the responses of various cell populations to TNF or other cytokine stimulation, or to use TNF-Rs 
effectively In therapy, diagnosis, or assay, purified compositions of TNFR are needed. Such compositions, 
however, are obtainable in practical yiekis only by ctoning and expressing genes encoding the receptors 
using recombinant ONA technotogy. Efforst to purify the TNFR molecule for use in biochemical analysis or 

45 to ck)ne and express mammalian genes encoding TNF-R. however, have been impeded by lack of a 
suitable source of receptor protein or mRNA. Prior to the present invention, no celt lines were known to 
express high levels of TNF-R constttutivety and continuously, which precluded purification of receptor for 
sequencing or construction of genetic libraries for cONA ck)ning. 

so 

SUMMARY OF THE INVENTION 



The present invention provides Isolated TNF receptors and ONA sequences encoding mammafian 
tumor necrosis fisctor receptors (TNF^). In particular, human TNF-Rs. Such ONA sequences include (a) 
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cDNA clones having a nucleotide sequence derived from the coding region of a native TNF-R gene; (b) 
DNA sequences which are capable of hybridization to the cONA clones of (a) under moderately stringent 
conditions and which encode biologically active TNF-R molecules; or (c) DNA sequences which are 
degenerate as a result of the genetic code to the DNA sequences defined in (a) and (b) and which encode 

5 biologically active TNF-R molecules, in particular, the present invention provides DNA sequences which 
encode soluble TNF receptors. 

The present invention also provides recombinant expression vectors comprising the DNA sequences 
defined aiMve* recombinant TNF-R molecules produced using the recombinant expression vectors, and 
processes for producing the recombinant TNF-R molecules using the expression vectors. 

10 The present Invention also provides Isolated or purified protein compositions comprising TNF-R, and. in 
particular, soluble forms of TNF-R. 

The present invention also provides compositions for use in therapy, diagnosis, assay of TNF-R. or in 
raising antibodies to TNF-R, comprising effective quantities of soluble native or recombinant receptor 
proteins prepared according to the foregoing processes. 

75 Because of the ability of TNF to specifically bind TNF receptors (TNF-Rs), purified TNF-R compositions 
will t>e useful in diagnostic assays for TNF. as well as In raising antibodies to TNF receptor for use in 
diagnosis and therapy. In addition, purified TNF receptor compositions may be used directly in therapy to 
bind or scavenge TNF. thereby providing a means for regulating the immune activities of this cytoidne. 
These and other aspects of the present invention will t)ecome evident upon reference to the following 

20 detailed description. 

BRIEF DESCRIPTION OF THE DRAWiNQS 

25 

Rgure 1 is a schematic representation of the coding region of various cDNAs encoding human and 
murine TNF-Rs. The leader sequence Is hatched and ttie transmembrane region is solid. 
Rgure 2A-2B depict the partial cONA so^nce. and derived^ ^ino add sequence of tiie human TNF-R 
clone 1. Nucleotides are numbered from the beginning of &m^S untranslated region. Amino acids are 

30 numbered from the beginning of the signal peptide sequence. The putative signal peptide sequence is 
represented by the amino acids -22 to -1. The N-termlnai leucine of the mature TNF-R protein Is 
underiined at position 1. The predicted transmembrane region from amino acids 236 to 265 is also 
underiined. The C-termini of various soluble TNF-Rs are mari<ed with an an-ow (t). 
Rgure 3A-3C depict the cDNA sequence and derive<l-«mino acid sequence of murine TNF-R clone 11. 

35 The putative signal peptide sequence is represented by amino acids -22 to -1. 1. The N-terminal valine 
of the mature TNF-R protein is underiined at position 1. The predicted transmembrane region from 
amino acids 234 to 265 is also underiined. 

DETAILED DESCRiPTlON OF THE INVEKTIOW 

40 



Definitions 

45 As used herein, the terms "TNF receptor" and TNF-R" refer to proteins having amino acid sequences 
which are substantially similar to the native mammalian TNF receptor amino add sequences, and which are 
biologicaily active, as defined betow, in tiiat tiiey are capable of binding TNF molecules or transdudng a 
biological signal Initiated by a TNF molecule binding to a cell, or cross-reacting with anti-TNF-R antibodies 
raised against TNFR from natural (I.e., nonrecombinant) sources. The mature fulWength human TNF-R is a 

90 glycoprotein having a molecular weight of about 80 Wlodaltons (kDa). As used tivoughout tite specification, 
the term "mature" means a protein expressed In a form lacking a leader sequence as may be present In 
full-length transcripts of a native gene. Experiments using COS cells transfected with a cDNA encoding full- 
lengtti human TNF-R show.ed that TNF-R bound ^»l-TNFa witii an apparent K. of about 5 x 10» M-\ and 
that TNF-R bound ^^hJUffi with an apparent K, of about 2x10* M"'. The terms "TNF receptor" or "TNF- 

66 R* indude, but are not Dmited to, analogs or subunits of native proteins having at least 20 amino adds and 
which exhibit at least some biological activity in common wttti TNFfl, for example, soluble TNF-R 
constructs which are devoid of a transmembrane region (and are secreted from ttie cell) but retain the 
ability to bind TNF. Various bloequivaJent protein and amino add analogs are described in detail below. 
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The nomenclature for TNF-R analogs as used herein follows the convention of naming the protein (e,g., 
TNF*R) preceded by either hu (for human) or mu (for murine) and followed by a A (to designate a deletion) 
and the number of the C-terminaJ amino add. For example. huTNF-RA235 refers to human TNF-R having 
Asp^ as the C-tenminaJ amino acid (i.e.. a polypeptide having the sequence of amino acids 1-235 of Rgure 

5 2A)* In the absence of any human or murine species designation. TNF-R refers generically to mammalian 
TNF-R. Similarly, in the absence of any specific designation for deletion mutants, the term TNF*R means all 
forms of TNF-R, Including mutants and analogs which possess TNF-R biological activity. 

■Soluble TNF-R" or "sTNF-R* as used in the context of the present Invention refer to proteins, or 
substantially equivalent analogs, having an amino add sequence corresponding to all or part of the 

10 extracellular region of a native TNF-R. for example. huTNF-RA235. huTNF-RA185 and huTNF-RAie3, or 
amino add sequences substantially similar to tiie sequences of amino acids 1-163. amino acids 1-185. or 
amino adds 1-235 of Rgure 2A, and which are biologically active In ttiat they bind to TNF ligand. 
Equivalent soluble TNF-Rs indude polypeptides which vary from titese sequences by one or more 
substitutions, deletions, or additions, and which retain ti^ ability to bind TNF or inhibit TNF signal 

15 transduction activity via cell surface bound TNF receptor proteins, for example huTNF-RAx. wherein x is 
selected from the group consisting of any one of amino adds 163-235 of Rgure 2A. Analogous deletions 
may be made to muTNF-R. Inhibition of TNF signal transduction activity can be determined by transfecting 
cells with recombinant TNF-R DNAs to obtain recombinant receptor expression. The cells are then 
contacted with TNF and tiie resulting metabolic effects examined. If an effect results which is attributable to 

20 tiie action of the ligand. then the recombinant receptor has signal transduction activity. Exemplary 
procedures for determining whether a polypeptide has signal transduction activity are disclosed by Idzerda 
et al.. J. Exp. MeA 171:66^ (1990); Curtis et ai.. Froc Natl. Acad. ScL (/SA 85^3045 (1989): Prywes et al.. 
EMBO J. 5-.217g(19a8) and Chou et ai., J. Biol. Chem. 262:1842 (1987). Alternatively, primary cells or cell 
lines which express an endogenous TNF receptor and have a detectable biological response to TNF could 

25 also be utilized. 

The term "isolated" or "purified", as used in the context of this spedfication to define the purity of 
TNF-R protein or protein compositions, means that the protein or protein composition is substantially free of 
other proteins of natural or endogenocrar origin and cqntainsjess than about 1% by mass or pr otein 
contaminants residual of production processes. Such compositions, however, can contain other proteins 

30 added as stabilizers, caniers. exdpients or co-tiierapeutics. TNF-R is isolated if it is detectable as a single 
protein t)and In a polyacrylamide gel by silver staining. 

The term "substantially similar." when used to define eititer amino add or nucleic acid sequences, 
means that a particular subject sequence, for example, a mutant sequence, varies from a reference 
sequence by one or more substitutions, deletions.- or adMons. the net effect of which is to retain biological 

35 activity of the TNF-R protein as may be detennined. for example, in one of the TNF-R binding assays set 
forth Jn Example 1 below. Alternatively, nucleic add sut>units and analogs are "substantially similar" to the 
specific DNA sequences disclosed herein if: (a) the DNA sequence is derived from the coding region of a 
native mammalian TNFR gene; (b) tiie DNA sequence is capable of hybridization to DNA sequences of (a) 
under moderately stringent conditions (50 *C, 2x SSC) and which encode biologically active TNF-R 

40 molecules: or DNA sequences which are degenerate as a result of the genetic code to the DNA sequences 
defined in (a) or (b) and which encode biotogically active TNF-R molecules. 

"Recombinant" as used herein, means ttiat a protein is derived from recombinant (e.g.. microbial or 
mammalian) expression systems. "Microbial' refers to recombinant proteins made in bacterial or fungal 
(e.g^ yeast) expression systems. As a product 'recombinant microbial" defines a protein produced in a 

45 microbial expression system which Is essentially free of native endogenous substances. Protein expressed 
in most bacterial cultures, e.g^ £ coll, will be free of glycan. Protein expressed in yeast may have a 
gtycosylation pattern different from that expressed In mammalian ceils. 

"Bioiogfcalfy actWe.* as used ttiroughout ttie spedfication as a characteristic of TNF receptors, means 
that a particular molecule shares suffident amino add sequence similarity with the embodiments of the 

50 present Invention disclosed herein to be capable of binding detectable quantities of TNF, transmitting a TNF 
stimulus to a cell, for example, as a component of a hybrid receptor construct or cross-reacting with anti- 
TNF-R antibodies r^sed against TNFR from natural (l On nonrecombinant) sources. Preferably, blotogtcally 
active TNF receptors within .the scope of the present invention are capable of binding greater than 0.1 
nmoles TNF per nmole receptor, and most preferably, greater ttian QJS nmole TNF per nmole receptor In 

55 standard binding assays j(see below). 

"Isolated DNA sequence" refers to a DHA polymer, in \he form of a separate fragment or as a 
component of a larger DNA construct which has been derived from DNA isolated at least once (n 
substantially pure form, le^ free of contaminating endogenous materials and in a quantity or concentratkKi 
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enabling identification, manipulation, and recovery of the sequence and its component nucleotide se- 
quences by standard biochemical methods, for example, using a cloning vector. Such sequences are 
preferably provided in the fonm of an open reading frame uninterrupted by intemai nontranslated se- 
quences, or introns. which are typically present in eukaryotic genes. Genomic ONA containing the relevant 

5 sequences could also be used as a source of coding sequences. Sequences of non-translated DNA may be 
present s' or 3' from the open reading frame, where the same do not interfere with manipulation or 
expression of the coding regions. 

"Nucleotide sequence* refers to a heteropctymer of deoxyribonucleotides. ONA sequences encoding 
the proteins provided by this invention can be assembled from cOf^ fragments and short oligonucleotide 

70 linkers, or from a series of oligonucleotides, to provide a synthetic gene which is capable of being 
expressed in a recombinant transcriptional unit 



Isolation of cDNAs Encoding TNF-R 

16 

The coding sequence of TNF-R is obtained by isolating a complementary DNA (cONA) sequence 
encoding TNF-R from a recombinant cONA or genomic DNA library. A cONA library Is preferably 
constructed by obtaining polyadenylated mRNA from a particular cell line which expresses a mammalian 
TNF-R, for example, the human fibroblast cell line Wl-2e VM (ATCC CCL 95.1) and using the mRNA as a 

20 template for synthesizing double stranded cDNA. The double strarxled cDNA is then packaged into a 
recombinant vector, which Is Introduced Into an appropriate £ coli strain and propagated. Murine or other 
mammalian cell lines which express TNF-R may also be used. TNF-R sequences contained in the cDNA 
library can be readily identified by screening the library wHh an appropriate nucleic acid probe which is 
capable of hybridizing with TNF-R cDNA. Altematively. DNAs encoding TNF-R proteins can be assembled' 

25 . by ligation of synthetic oligonucleotide subunits corresponding to all or part of the sequence of Figures 2A- 
2B or 3A-3C to provide a complete coding sequence. 

The human TNF receptor cDNAs of tiie present Invention were isolated by ttie method of direct 
expression cloning. A cONA library was con^ructed by first isolating cytoplasmic mRNA from the human 
fibroblast cell line WI-26 VA4. Polyadenylated RNA was IsolatfiS and used to prepare double-stranded^ 

30 cDNA. Purified cONA fragments were then ligated Into pCAV/NOT vector DNA which uses regulatory 
sequences derived from pDC201 (a derivative of pMLSV. previously described by Cosman et al.. Nature 
3/2:768, 1984). SV40 and cytomegalovirus DNA. described In detail below in Example 2. pCAV/NOT has 
been deposited with the American Type Culture Collection under accession No. ATCC 68014. The 
pCAV/NOT vectors containing the W126-VA4 cDNA.fraggiMts were transformed Into £. coU strain DHSa. 

39 Transformants were plated to provide approximately 800 colonies per plate. The resulting colonies were 
harvested and each pool used to prepare plasmid DNA for transfection into COS-7 cells essentially as 
described by Cosman et al. {Nature 3/2:768. 1984) and Lutiiman et al. {NucL Acid Res, r/:l295. 1983). 
Transfonmants expressing biologically active cell surface TNF receptors were identified by screening for 
their ability to bind *^I-TNF. In tills screening approach, transfected COS-7 cells were Incubated with 

40 medium containing ^^hTNF, the cells washed to remove unbound labeled TNF, and the cell monolayers 
contacted X-ray film lo detect concentrations of TNF binding; as disclosed by Sims et al. Science 
241:565 (1988). Transfectants detected in tills manner appear as dark fod against a relatively light 
background. 

Using this approach, approximately 240.000 cDNAs were screened in pools of approximately 800 
45 cDNAs until assay of one transfectant pool indicated positive fod for TNF binding. A frozen stock of 
bacteria from tills positive pool was grown in cutture and plated to provide individual colonies, which were 
screened until a single done (clone 11) was Identified which was capable of directing synthesis of a surface 
protein with detectable TNF binding activity. The sequence of cDNA clone 11 Isolated by the above method 
is depicted in Hgures 3A-3C. 
60 Additional cONA dones can be isolated from cDNA libraries of ottier mammaOan spedes by cross- 
spedes hybridization. For use in hybridization, ONA encoding TNFR may be covalentiy labeled with a 
detectable substance such as a fluorescent group, a radioactive atom or a chemiluminescent group by 
methods well known to titose skilled In the art Such probes could also be used for in vitro diagnosis of 
particular conditions. 

55 Like most mammalian genes, mammalian TNF receptors are presumably encoded by multi-exon genes. 
Alternative mRNA constructs which can be attributed to different mRNA spHdng events following transcrip- 
tion, aruj which share large regions of Identity or similarity with tt)e cONAs claimed herein, are considered 
to be within the scope of the present invention. 
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Other mammalian TNF-R cDNAs are Isolated by using an appropriate human TNF-R DNA sequence as 
a probe for screening a particular mammalian cONA library by cross-species hybridization. 



5 Proteins and Analogs 

The present invention provides Isolated recombinant mammalian TNF-R polypeptides. Isolated TNF-R 
polypeptides of this invention are substantially free of other contaminating materials of natural or endog- 
enous origin and contain less than about 1% by mass of protein contaminants residual of production 

10 processes. The native human TNF-R molecules are recovered from cell lysates as glycoproteins having an 
apparent molecular weight by SOS-PAGE of about 80 Idlodaftons (kOa). The TNF-R polypeptides of this 
invention are optionally without associated native-pattern glycosylation. 

Mammalian TNF-R of the present invention includes, by way of example, primate, human, marine, 
canine, feline, bovine, ovine, equine and porcine TNF-R. Mammalian TNF-Rs can be obtained by cross 

75 species hybridization, using a single stranded cONA derived from the human TNF-R ONA sequence as a 
hybridization probe to isolate TNF-R cONAs from mammalian cOf^ libraries. 

Derivatives of TNF-R within the scope of the invention also include various structural fomns of the 
primary protein which retain biological activity. CXie to the presence of ionlzable amino and carboxyl groups, 
for e}cample. a TNFR protein may be In the form of acidic or basic salts, or may be in neutral form. 

so Individual amino add residues may also be modified by oxidation or reduction. 

The primary amino add structure may be modified by forming covalent or aggregative conjugates with 
otiier chemical moieties, such as glycosyt groups, lipids, phosphate, acetyl groups and the like, or by 
creating amino add sequence mutants. Covalent derivatives are prepared by linking particular functional 
groups to TNF-R amino add side chains or at the N-or C-termini. Other derivatives of TNF-R within tiie 

25 scope of this invention include covalent or aggregative conjugates of TNF-R or its fragments with other 
proteins or polypeptides, such as by synthesis in recombinant culture as N-terminal or Oterminal fusions. 
For example, the conjugated peptide may be a a signal (or leader) polypeptide sequence at the N-terminal 
region of the protein which co-translatiorMfly or post-translationally directs transfer of the protein fro m- its 
site of synthesis to its site of function inside or outside of tfte celTmembrane or wall (e.g., the yeast a-factOr 

30 leader). TNF-R protein fusions can comprise peptides added to fadlitate purification or identification of TNF- 
R (e.g., poly-His). The amino add sequence of TNF receptor can also be linked to the peptide Asp-Tyr-Lys- 
Asp- Asp-Asp-Asp-Lys (DYKDDOOK) (Hopp et al.. Bio/Technology 6;i 204,1 98a) The latter sequence is 
highly antigenic and provides an epitope reversibly tx>und by a specific monoclonal antibody, enabling 
rapid assay and fadte purification of expressed reconM^ant protein. This sequence is also specifically 

35 cleaved by bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. Fusion 
proteins capped with this peptide may also be resistant to intracellular degradation in £ coll 

TNF-R derivatives may also be used as immunogens. reagents in receptor-t>ased immunoassays, or as 
binding agents for affinity purification procedures of TNF or otiner binding ligands. TNF-R derivatives may 
also be obtained by cross-linking agents, such as M^aleimidobenzoyl sucdnimide ester and N-hydrox* 

40 ysucdnimide, at cysteine and lysine residues. TNF-R proteins may also be covalently bound through 
reactive side groups to various insoluble substrates, such as cyanogen bromide-activated, bisoxirane- 
activated, cart)onykiIimidazole-activated or tosyl-actfvated agarose structures, or by adsorbing to polyolefin 
surfaces (with or without glutaraldehyde cross-tlnking). Once bound to a substrate, TNF-R may be used to 
selectively bind (for purposes of assay or purification) anti-TNFR antibodies or TNF. 

45 The present Invention also Indudes TNF-R with or without assodated native-pattern glycosylation. TNF- 
R expressed in yeast or mammaOan expression systems, e.g., COS-7 cells, may be similar or slightiy 
different in nrolecular weight ai%J glycosylation pattern than the native molecules, depending upon the 
expression system. Expression of TNF-R ONAs in bacteria such as £ coil provides non-glycosylated 
molecules. Functional mutant analogs of mammalian TNF-R having inactivated N^lycosylation sites can be 

60 produced by oligonucleotide synthesis and ligation or by stte-spectfic mutagenesis techniques. These 
anak)g proteins can be produced in a homogeneous, reduced-cartx)hydrate form in good yiekl using yeast 
expression systems. N-glycosytation sites in eukaryotic proteins are characterized by the amino add triplet 
Asn-AfZ, where At Is any amino add except Pro. and Z is Ser or Thr. In this sequence, asparagine 
provides a side diain amino group for covalent attachment of carbohydrate. Such a site can be eliminated 

55 by sut>strtuting another amino add for Asn or for residue Z, deleting A^ or Z, or inserting a nor)-Z amino 
add between Ai and Z. or an amino add other than Asn between Aot and At. 

TNF-R derivatives may also be obtained by mutations of TNF-R or its subunlts. A TNF-R mutant as 
referred to herein, is a polypeptide homok)gous to TNF-R but which has an amino add sequence different 
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from native TNF-R because of a deletion, insertion or substitution. 

Bioequivalent analogs of TNF-R proteins may be constructed by. for example, making various 
substitutions of residues or sequences or deleting terminal or intemai residues or sequences not needed for 
biological activity. For example, cysteine residues can be deleted (e.g., Cys*'*) or replaced with other 

5 amino acids to prevent formation of unnecessary or incorrect intramolecular disulfide bridges upon 
renaturation. Other approaches to mutagenesis involve modification of adjacent dibasic amino add residues 
to enhance expression in yeast systems in which KEX2 protease activity is present. Generally, substitutions 
should be made conservatively; i.e., tiie most preferred substitute amino acids are those having 
physiochemical characteristics resembling those of the residue to be replaced. Similarly, when a deletion or 

TO Insertion strategy is adopted, the potential effect of tiie deletion or insertion on biological activity should be 
considered. Substantially similar polypeptide sequences, as defined above, generally comprise a like 
number of amino acids sequences, although C-terminal truncations for the purpose of constructing soluble 
TNF-Rs will contain fewer amino acid sequences. In order to preserve tiie biological activity of TNF*Rs. 
deletions and substitutions will preferably result in homologous or conservatively substituted sequences. 

15 meaning tiiat a given residue is replaced by a biologically similar residue. Examples of conservative 
substitutions include substitution of one aliphatic residue for another, such as He. Val. Leu. or Ala for one 
another, or substitutions of one polar residue for another, such as t)etween Lys and Arg; Qlu and Asp; or 
Gin and Asn. Other such conservative substitutions, for example, substitutions of entire regions having 
similar hydrophobicity characteristics, are well known. Moreover, particular amino add differences between 

20 human, murine and other mammalian TNF-Rs is suggestive of additional conservative substitutions that 
may be made witiiout altering the essential biological characteristics of TNF-R. 

Subunits of TNF-R may be constructed by deleting tenminal or intemai residues or sequences. 
Particularly preferred sequences include those in which the transmembrane region and intracellular domain 
of TNF-R are deleted or substituted with hydrophlllc residues to fadlitate secretion of the receptor Into the"^' 

25 cell culture medium. The resulting protein is refenBd to as a soluble TNFR molecule which retains its 
ability to bind TNF. A particulariy preferred soluble TNF-R construct is TNF-RA235 (the sequence of amtno 
acids 1-235 of Rgure 2A), which comprises the entire extracellular region of TNF-R« terminating with Asp^ 
immediately adjacent the transmembrane regjpn. Additional^ amjno adds may be deleted from the trans* 
membrane region while retaining TNF binding activity. For'exarr^e. huTNF-RA183 which comprises tfiar* — 

30 sequence of amino adds 1-183 of Figure 2A. and TNF-RA163 which comprises tine sequence of amino 
adds 1-163 of Figure 2A. retain the ability to bind TNF ligand as determined using the binding assays 
described below in Example 1. TNF-RA142. however, does not retain the ability to bind TNF ligand. This 
suggests tiiat one or both of Cys*'^ and Cys^^^ Is required for fonmation of an intramolecular disulfide 
bridge for tiie proper folding of TNF-R. Cys^^', which wa&^leted witiiout any apparent adverse effect on 

35 the ability of the soluble TNF-R to bind TNF, does not appear to be essential for proper folding of TNF-R. 
Thus, any deletion C-terminal to Cys**' woukj be expected to result in a blotogicalty active soluble TNF-R. 
The present invention contemplates such soluble TNF-R constructs corresponding to all or part of tiie 
extracellular region of TNF-R terminating with any amino add after Cys^*^. Ottrer C-terminal deletions, such 
as TNF-FA157. may be made as a matter of convenience by cutting TNF-R cDNA with appropriate 

40 restriction enzymes and. if necessary, reconstructing specific sequences witti synthetic oligonucleotide 
linkers. The resulting soluble TNF-R constructs are then Inserted and expressed in appropriate expression 
vectors and assayed for the ability to bind TNF. as described in Example 1. Biologically active soluble TNF- 
Rs resulting from such constructions are also contemplated to be witiiln the scope of the present invention. 
Mutations In nucleotide sequences constructed for expression of analog TNF-R must, of course, 

45 present tire reading frame phase of the coding sequences and preferably will not create complementary 
regions that could hybridize to produce secondary mRNA strvctures such as k)ops or hairpins which would 
adversely affect translation of the receptor mRNA. Although a mutation site may be predetermined, it is not 
necessary titat the nature of the mutation per se be predetermined. For example, in order to select for 
optimum characteristics of mutants at a given site, random mutagenesis may be conducted at the target 

so codon and the expressed TNF-R mutants screened for the desired activity. 

Not all mutations in the nucleotide sequence which encodes TNF-R will be expressed In ttie final 
product for example, nucleotide substitutions may be made to enhance expression, primarily to avdd 
secondary structure k)ops In the transcribed mRNA (see EPA 75.444A, Incorporated herein by reference), 
or to provide codons that are more readily translated by tiie selected host e.g.. the well-known £. coil 

60 preferBncecodonsforf. CO// expression. 

Mutations can be introduced at particular k)d by synthesizing oQgonudeotides containing a mutant 
sequence, flanked by restriction sites enabOng Ogation to fragments of the native sequence. Following 
ligation, tiie resulting reconstructed sequence encodes an analog having the desired amino add insertion. 
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substitution, or deletion. 

Alternatively, oligonucleotide^irected srte*specific mutagenesie procedures can be employed to provide 
an altered gene having particular codons altered according to the substitution, deletion, or insertion 
required. Exemplary methods of making the alterations set forth above are disclosed by Walder et al. (Gene 

5 42:133, 1986): Bauer et al. {Gene 37:73, 1985); Craik (BtoTechnlques, January 1985, 12-19): Smith et al. 
(Genetic Engineering: Principles end Methods, Plenum Press, 1981); and U.S. Patent Nos, 4,518.584 and 
4,737.462 disclose suitable techniques, and are incorporated by reference herein. 

Both monovalent forms and polyvalent forms of TNF-R are useful In the compositions and methods of 
this invention. Polyvalent forms possess multiple TNF«R binding sites for TNF ligand. For example, a 

70 bivalent soluble TNF-R may consist of two tandem repeats of amino acids 1-235 of Rgure 2A. separated by 
a linker region. Alternate polyvalent forms may also be constructed, for example, by chemically coupling 
TNF-R to any clinically acceptable canier molecule, a polymer selected from the group consisting of Rcoll. 
polyethylene glycol or dextran using conventional coupling techniques. Alternatively. TNF-R may be 
chemically coupled to biotin. the biotin-TNF-R conjugate then^ allowed to bind to avidin, resulting in 

75 letravalent avidln/biotin/TNF-R molecules. TNF-R may also be covalently coupled to dinitrophenol (DNP) or 
trinitrophenol (TUP) and the resulting conjugate precipitated with anti-DNP or anti-TNP-IgM, to form 
decameric conjugates with a valency of 10 for TNF-R binding sites. 

A recombinant chimeric antibody molecule may also be produced having TNFR sequences substituted 
for the variable domains of either or both of the immunoglutMjIln molecule heavy and light chains and 

20 having unmodified constant region domains. For example, chimeric TNF-R/IgQi may be produced from two 
chimeric genes - a TNF-R/human « light chain chimera (TNF-R/CJ and a TNF-R/human 71 heavy chain 
chimera (TNF-R/C^i). Following transcription and translatton of the two chimeric genes, the gene products 
assemble into a single chimeric antibody molecule having TNF-R displayed bivafentiy. Such potyvaleot 
forms of TNFR may have enhanced binding affinity for TNF Rgand, Additional details relating to the 

25 construction of such chimeric antibody molecules are disclosed In WO 89A}9622 and EP 315062. 



Expression of Recombinant TNF-R r - _ ^ 

30 The present invention provides recombinant expression vectors to amplify or express ONA encoding 
TNF-R. Recombinant expression vectors are repUcable ONA constnjcts which have synthetic or cDNA- 
derived ONA fragments encoding mammalian TNF-R or bioequivalent analogs operably linked to suitable 
transcriptional or translational regulatory elements derived from mammalian, microbial, viral or insect genes. 
A transcriptional unit generally comprises an assembl^TTf (1) a genetic element or elements having a 

35 regulatory role in gene expression, for example, transcriptional promoters or enhancers, (2) a structural or 
coding sequence which is transcribed into mRNA and translated Into protein, and (3) appropriate transcrip- 
tion and translation Initiation and termination sequences, as described in detail below. Such regulatory 
elements may Include an operator sequence to control transcription, a sequence encoding suitable mRNA 
ribosomal binding sites. The ability to replicate in a host, usually conferred by an origin of replication, and a 

40 selection gene to facilitate recognition of transformants may additionally be incorporated. DNA regions are 
operably linked when they are functionally related to each other. For example, DNA for a signal peptide 
(secretory leader) is operably linked to DNA fbr a polypeptide if it is expressed as a precursor which 
participates In the secretkxi of the polypeptide; a promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding 

45 sequence if it is positfoned so as to peontt translation. Qeneralty, operably linked means contiguous and, in 
the case of secretory leaders, contiguous and in reading frame. Structural elements Intended for use in 
yeast ex;»es8ion systems preferably include a leader sequence enabling extracellular seaetion of trans* 
lated protein by a host ceil. Alternatively, where recombinant protein Is expressed without a leader or 
transport sequence, it may Include an N-terminal mettiipnine residue. This residue may optionally be 

50 subsequentiy cleaved from the expressed recombinant protein to provide a final product 

ONA sequences encoding mammalian TNF receptors which are to be expressed in a microorganism 
will preferably contain no introns that coukl prematurely terminate transcription of DNA into mRNA; 
however, immature tenmination of transcription may be desirable, fbr example, where It wouki result in 
mutants having advarrtageous C-termlnal truncations, for example, deletion of a transmembrane region to 

55 yield a solut)to receptee not bound to the cell membrane. Due to code degeneracy, ttiere can be 
considerable variation in nucleotide sequences encoding the same amino add sequence. Other embodi- 
ments include sequences capable of hybridizing to the sequences of ttie provided cONA i^ider moderately 
stringent conditions (50* C. 2x BSC) and other sequences hybridizing or (^generate to those which encode 
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biologically active TNF receptor polypeptides. 

Recombinant TNF-R DNA is expressed or amplified in a recombinant expression system comprising a 
substantially homogeneous monoculture of surtabie host microorganisms, for example, bacteria such as £ 
coll or yeast such as 5. cerevlslae, which have stably integrated (by transformation or transfection) a 
5 recombinant transcriptional unit Into chromosomal DNA or carry the recombinant transcriptional unit as a 
component of a resident ptasmid. Generally, cells constituting the system are the progeny of a single 
ancestral transformant Recombinant expression systems as defined herein will express heterologous 
protein upon induction of the regulatory elements linked to the DNA sequence or synthetic gene to be 
expressed. 

10 Transformed host cells are cells which have been transformed or transfected with TNF-R vectors 
constructed using recombinant DMA techniques. Transformed host cells ordinarily express TNF-R. but host 
ceils transformed for purposes of cloning or amplifying TNF-R DNA do not need to express TNF-R. 
Expressed TNF*R wilt be deposited in the cell membrane or secreted Into the culture supernatant 
depending on the TNF-R DNA selected. Suitable host cells for expression of mammalian TNF-R include 

75. prokaryotes. yeast or higher eukaryotic cells under the control of appropriate promoters. Prokaryotes 
include gram negative or gram positive organisms, for example £. cofi or bacilli. Higher eukaryotic cells 
include established cell lines of mammalian origin as described below. Cell-free translation systems could 
also be employed to produce mammalian TNF-R using RNAs derived from the DNA constnjcts of the 
present invention. Appropriate cloning and expressk^ vectors for use with bacterial, fungal, yeast, and 

20 mammalian cellular hosts are described by Pouwels et al. (Cioning Vectors: A Laboratory Manual, 
Elsevier. New Yoric. 1985), the relevant disclosure of which is hereby incorporated by reference. 

Prokaryotic expression hosts may be used for expression of TNF-R that do not require extensive 
proteolytic and disulfide processing. Prokaryotic expression vectors generally comprise one or more, 
phenotypic selectable markers, for example a gene encoding proteins confening antibiotic resistance or 

25 supplying an autotrophic requirement, and an origin of replication recognized ty the host to ensure 
amplification within the host Suitable prokaryotic hosts for transformation include £ coll, Bacillus subtltts. 
Salmonella typhfmurfum, and various species within the genera Pseudomonas, Streptomyces, and 

Staphyolococcus, although others may ais^be empkjyed as a matter of choice. - ^ 

Useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin or 

3C replication derived from commerctalty available plasmlds comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example. pKK223-3 
(Pharniada Rne Chemicals, Uppsala, Sweden) and pGEMI (Promega Biotec, Madison, Wl, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and tiie structural sequence to be 
expressed. £ coll Is typically transfomned using derivativee-of pBR322. a plasmid derived from an £ coll 

35 species (Bolivar et al.. Gene 2:95, 1977). pBR322 contains genes for ampictllin and tetracycline resistance 
and thus provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include the /3-lactamase 
(penicillinase) and lactose promoter system (Chiang et al.. Nature 275:615, 1978; and Qoeddel et al.. 
Nature 281:544, 1979), the tryptophan (trp) promoter system (Goeddel et al.. NucL Acids Res, 5:4057. 

40 1980; and EPA 36,776) and tac promoter (Maniatis. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularty useful bacterial expression system employs the phage X Pt 
promoter and cl857l8 thermolabile repressor. Plasmid vectors available from the American Type Culture 
Cksllection which incorporate derivatives of tiw X Pl promoter include plasmid pHUB2. resident In £ coll 
strain JMB9 (ATCC 37092) and pPLc2a, resident in £ coll RR1 (ATCC 53082). 

45 Recombinant TNF-R proteins may also be expressed in yeast hosts, preferably from the Sac- 
charomyces spedes, such as S. cerevlslae. Yeast of other genera, such as PIchIa or Ktuyveromyces 
may also be emptoyed. Yeast vectors will generally contain an origin of repDcation from the 2u yeast 
plasmkJ or an autonomously replicating sequence (AR3). promoter, OfiA encoding TNF-R, sequences for 
polyadenylation and transcription termination and a selection gene. Preferably, yeast vectors will include an 

so origin of repncation and selectable marker penmitting transformation of both yeast and £ coll, e.g.. the 
ampidllin resistance gene of £ coll and S. cerevlslae TRP1 or URA3 gene, which provides a selection 
martcer for a mutant strain of yeast lacking the aWniy to grow in tryptophan, and a promoter derived from a 
highly expressed yeast gene to induce transcription of a structural sequence downstream. The presence of 
the TRP1 or URA3 lesion in the yeast host cell genome then provides an effective environment for 

5S (tetecting transformation l)y^ growth In the absence of tryptophan or uradl. 

Suitable promote s^^uences In yeast vectors Include the promoters for metaltathioneln, 3- 
phosphoglycerate kinase (Hitreman et al., Jl Btol Chem, 25S-.2073, 1980) or other glycolytic enzymes 
(Hess et aL, J. Adv. Enzyme Reg. 7:149, 1968; and Holland et aL, Bioehem. /7:4900, 1978), such as 
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enolase. glyc8raldehyde-3-phosphate dehydrogenase, hexoWnase, pyruvate decartwxylase. phosphofruo 
tokinase, glucose-6-phosphate isomerase. 3-phosphoglycerate mutase, pyruvate kir^e. triosephosphate 
isomerase, phosphoglucose isomerase, and glucokinase. Suitable vectors and promoters for use in yeast 
expression are further described in R. Httzeman et al.. EPA 73,657. 

5 Preferred yeast vectors can be assembled using DMA sequences from pUCIS for selection and 
replication in £ coff (Amp' gene and origin of replication) and yeast ONA sequences including a giucose- 
represslble A0H2 promoter and a-factor secretion leader. The A0H2 promoter has been described by 
Russell et al. (J. Biol. Chem. 25ft2674, 1982) and Beier et al. {Nature 300724, 1982). The yeast o-factor 
leader, which directs secretion of heterologous proteins, can be Inserted between the promoter and the 

10 structural gene to be expressed. See, d.?.. Kurjan et al.. Ceii 30'.933. 1982; and Bitter et al., Proc. NatL 
Acad ScL USA 81:5330, 1984. The leader sequence may be modified to contain, near its 3' end. one or 
more useful restriction sites to facilitate fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to ttiose of skill In ttie art; an exemplary technique is 
described by HInnen et al., Proc. NatL Acad, ScL USA 75:1929, 1978, selecting for Trp* transformants in a 

t5 selective medium consisting of 0.67% yeast nitrogen base, 0.5% casamino adds, 2% glucose. 10 ug/mt 
adenine and 20 ug/ml uradi or URA-t* tranformants In medium consisting of 0.67% YNB, with amino adds 
and bases as described by Sherman et aL. Laboratory Course Manuai for Methods fn Yeast Genetics, 
Cold Spring Hartx}r Laboratory, CoU Spring Hartxx, New York. 1988. 

Host strains transformed by vectors comprising the A0H2 promoter may be grown for expression in a 

20 rich medium consisting of 1% yeast extract 2% peptone, and 1% or 4% glucose supplemented with 80 
ug/ml adenine and 80 ug/ml uradl. Derepression of the A0H2 promoter occurs upon exhaustion of nrtedium 
glucose. Crude yeast supematants are harvested by filtration and held at 4* C prior to hirther purification. 

Various mammalian or insect cell culture systems are also advantageously employed to express, 
recombinant protein. Expression of recombinant proteins in mammalian cells Is particularty preferred 

25 because such proteins are generally conectiy folded, appropriately modified and completely functional. 
Examples of suitable mammalian host cell Ones include ttie COS-7 Ones of nrrankey kidney cells, described 
by Qluzman (Ceil 23:175. 1981). and other cell Ones capable of expressing an appropriate vector Including, 
for example, L cells, CI 27. 3T3, Chinese^bamster ovary_(CKO), HeLa and BHK cell lines. Mamm agan 
expression .vectors may comprise nontranscribed elements such as an origin of replication, a suitable 

JO promoter and enhancer linked to the gene to be expressed, and other s' or 3' flanking nontranscribed 
sequences, and 5' or 3' nontranslated sequences, such as necessary ribosome binding sites, a 
polyadenylation site, splice donor and acceptor sites, and transcriptional termination sequences. Baculovirus 
systems for production of heterologous proteins In insect cells are reviewed by Luckow and Summers. 
Bio/Technology 6:47 (1988). 

35 The transcriptional and translational cc^trol sequences in expression vectors to be used in transforming 
vertebrate cells may be provided by viral sources. For example, commonly used promoters and enhancers 
are derived from Polyoma, Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. ONA 
sequences derived from the SV40 viral genome, for example, SV40 origin, eariy and late promoter, 
enhancer. spGce, and polyadenylation sites may be used to provide the ottier genetic elements required for 

40 expression of a heterologous ONA sequence. The eariy and late promoters are particularly useful because 
botti are obtained easily from ttie virus as a fragment which also contains ttie SV40 viral origin of replication 
(Rers et al.. ^afura 273:113, 1978). Smaller or larger SV40 fragments may also be used, provided ttie 
approximately 250 bp sequence extending from ttie Hind 3 site toward ttie Bgh site kx:ated In ttte viral 
origin of repOcation is included. Furttier, mammalian genomic TNF-R promoter, control and/or signal 

45 sequences may be utilized, provided such control sequences are compatible witti ttie host celt chosen. 
Additional details regarding the use of a mammalian high expression vector to produce a recombinant 
mammalian TNF receptor are provided In Examples 2 and 7 betow. Exemplary vectors can be constructed 
as diseased by Okayama and Berg (MoL CelL Biol. 3'.280, 1983). 

A useful system for stable high level expression of mammaOan receptor cONAs In C127 murine 

50 mammary epftheSal cells can be constructed substantially as described by Cosman et al. {MoL bnmimoL 
23:935, 1986). 

In preferred aspects of ttie present invention, recombinant expression vectors comprising TNFR cDNAs 
are stably Integrated into a host ceiTs ONA Bevated levels of expression product Is achieved by selecting 
for cell lines having ampGfied numbers of vector ONA. Cell Ones having amplified numt)er3 of vector ONA 
55 are selected, for example, by transforming a host cell witti a vector comprising a ONA sequence which 
encodes an enzyme whidi is Inhibited by a known drug. The vector may also comprise a ONA sequence 
which encodes a desired protein. Alternatively, the host cell may be oo-transfonmed witti a second vector 
which cornprises ttie ONA sequence which encodes ttie desired protein. The transformed or co-transformed 
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host cells are then cultured in increasing concentrations of the known drug, thereby selecting for drug- 
resistant ceils. Such drug-resistant cells sunnve in increased concentrations of the toxic drug by over- 
production of the enzyme which is inhibited by the dojg, frequently as a resutt of amplification of the gene 
encoding the enzyme. Where drug resistance is caused by an increase in the copy number of the vector 

5 DNA encoding the inhibrtable enzyme, there is a concomitant co-amplification of the vector ONA encoding 
the desired protein (TNF-R) In the host cell's DNA. 

A preferred system for such co-amplification uses the gene for dihydrofolate reductase (DHFR), which 
can be inhibited by the drug methotrexate (MTX). To achieve co-ampiification, a host cell which lacks an 
active gene encoding DHFR is either transfomned with a vector which comprises ONA sequence encoding 

to DHFR and a desired protein, or Is co-transformed with a vector comprising a DNA sequence encoding 
DHFR and a vector comprising a DNA sequence encoding the desired protein. The transformed or co- 
transformed host cells are cultured in media containing increasing levels of MTX and those cells lines 
which survive are selected. 

A particularly preferred co-amplification system uses the gene for glutamlne synthetase (QS). which is 

IS responsible for the synthesis of glutamate and ammonia using the hydrolysis of ATP to AOP and phosphate 
to drive tiie reaction. QS Is subject to Inhibition by a variety of inhibitors, for example methionine 
sulphoximlne (MSX). Thus, TNF-R can be expressed In high concentrations by co-amplifying cells 
transformed with a vector comprising the DNA sequence for QS and a desired protein, or co-transformed 
with a vector comprising a DNA sequence encoding GS and a vector comprising a DNA sequence encoding 

20 the desired protein, culturing the host cells in media containing increasing levisis of MSX and selecting for 
surviving cells. The GS co-amplification system, appropriate recombinant expression vectors and cells lines, 
are described in the following PCT appHcations: WO 87/04462, WO 89A)1036. WO 89/10404 and WO 
86A)5807. 

Recombinant proteins are preferably expressed by co-amplification of DHFR or GS in a mammalian 
25 host cell, such as Chinese Hamster Ovary (CHO) celts, or alternatively in a murine myeloma cell line, such 
as SP2/0-Ag14 or NSO or a rat myeloma cell line, such as YB2/3.0-Ag20. disclosed in PCT applications 
WO/89/10404 and WO 86/05807. 

A preferred eukaryotic vector for expwtsion of TNF-R^DNA Is disclosed below In Example 2^ This 
vector, refenwJ to as pCAV/NOT, was derived from the mammaflin high expression vector pDC20l an^ 
30 contains regulatory sequences from SV40, adenovirus-2, and human cytomegalovirus. 



Purification of Recombinant TNF-R 

36 Purified mammalian TNF receptors or analogs are prepared by culturing suitable hostA^ector systems to 
express the recombinant translation products of the DNAs of the present Invention, which are then purified 
from culture media or cell extracts. 

For example, supematants from systems which secrete recombinant protein into culture media can be 
first concentrated using a commercially available protein concentration filter, for example, an Amicon or 

40 Millipore Pellicon ultrafiltration unit Following the concentiation step, the concentrate can be applied to a 
suitable purification matrix. For example, a suitable affinity matrix can comprise a TNF or lectin or antibody 
molecule bound to a suttabte support Alternatively, an anion exchange resin can be employed, for example, 
a matrix or substrate having pendant diethylaminoethyl (OEAE) groups. The matrices can be acrylamide. 
agarose, dextran, cellulose or other types commonly employed In protein purification. Alternatively, a cation 

45 exchange step can be employed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopfopyl or carboxymethyl groups. Sulfopropyl groups are preferred. 

FinaOy, one or morB reversed-phase high performance liquid chromatography (RP-HPLC) steps employ- 
ing hydrophobic RP-HPLC media, 6.g., silica gel having pendant methyl or other aliphatic groups, can be 
employed to further purify a TNF-R composition. Some or all of the foregoing purification steps, in various 

so combinations, can also be employed to provide a homogeneous recombinant protein. 

Recombinant protein produced In bacterial culture Is usually isolated by initial extraction from cell 
pellets, followed by one or more concentration, salting-out aqueous ion exchange or size exclusion 
chromatography steps. Rnally, high perfamance Oquid chromatography (HPLC) can be employed for final 
purification steps. Microbial cells employed In expression of recombinant mammalian TNF-R can be 

68 disrupted by any convenient method, including freeze-thaw cycOng. sonlcation. mechanical disruption, or 
use of cell tysing agents. 

Fermentation of yeast which express mammafian TNF-R as a secreted protein greatiy Amplifies 
purification. Secreted recombinant protein resulting from a large-scale fermentation can be purified by 
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methods analogous to those disclosed by Urdal et al. (J. Chromatog, 2S&171, 1984). This reference 
describes two sequentiaJ, reversed-phase HPLC steps for purification of recombinant human GM-CSF on a 
preparative HPLC column. 

Human TNF-R synthesized (n recombinant culture is characterized by the presence of non-human cell 
5 components, including proteins, in amounts and of a character which depend upon the purification steps 
taken to recover human TNF-R from the culture. These components ordinarily will be of yeast, prokaryotic 
or non*human higher eukaryotlc origin and preferably are present in innocuous contaminant quantities, on 
the order of less than about 1 percent by weight. Further, recombinant cell culture enables the production of 
TNF*R free of proteins which may be normally associated with TNF-R as it Is found in nature in its species 
10 of origin, e.g. in cells, cell exudates or body fluids. 



Therapeutic Administration of Recombinant Soluble TNFR 

fS The present Invention provides mettiods of using therapeutic compositions comprising an effective 
amount of soluble TNFR proteins and a suitable diluent and canrier, and methods for suppressing TNF- 
dependent Inflammatory responses in humans comprising administering an effective amount of soluble 
TNF-R protein. 

For therapeutic use, purified soluble TNF-R protein Is administered to a patient, preferably a human, for 

20 treatment In a manner appropriate to ttie indication. Thus, for example, soluble TNF-R protein compositions 
can be administered by bolus Injection, continuous Infusion, sustained release from implartts, or otiier 
suitable technique. Typically, a soluble TNF-R therapeutic agent will be administered in the form of a 
composition comprising purified protein in conjunction with physiologically acceptable carriers, exciplents or 
diluents. Such carriers will be nontoxic to recipients at the dosages and concentrations emptoyed. 

25 Ordinarily, the preparation of such compositions entails combining the TNF-R witii buffers, antioxidants such 
as ascorbic acid, low molecular weight (less ttian about 10 residues) polypeptides, proteins, amino adds, 
carbohydrates including glucose, sucrose or dextrine, chelating agents such as EDTA, glutatiilone and other 
stabilizers and exciplents. Neutral buffered^saline or sailne mixed with conspecific serum albumin are 
exemplary appropriate diluents. Preferably, product is Tormufaed as a lyophilizate using appropn3l3~ 

30 exdpient solutions (e.g., sucrose) as diluents. Appropriate dosages can be determined in trials. The amount 
and frequency of administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, tiie desired response, tiie condition of the patient, and so forth. 

Soluble TNF-R proteins are administered for the purpose of Inhibiting TNF-dependent responses. A 
variety of diseases or conditions are believed to bacauaed by TNF, such as cachexia and septic shock. In 

35 addition, other key cytokines (IL-1. IL-2 and otiier colony stimulating factors) can also induce significant host 
production of TNF. Soluble TNF-R compositions may therefore be used, for example, to treat cachexia or 
septic shock or to treat side effects assodated witi) cytokine tiierapy. Because of the primary rotes lL-1 and 
lL-2 play in the production of TNF, combination tiierapy using both IL-1 receptors or tL-2 receptors may be 
preferred in tiie treatment of TNF-assodated clinical indications. 

40 The following examples are offered by way of Illustration, and not by way of limitation. 



EXAMPLES 

45 

Example 1 



so 

Binding Assays 



A. RadioiBbeHng of TNFa and TNFfi, Recombinant human TNFa. in the form of a fusion protein 
55 containing a hydrophillc octapeptide at ttte N-temnlnus. was expressed In yeast as a secreted protein and 
purified by affinity chromatography (Hopp et al., Bio/Technofpgy ft1204, 1888). Purffied recombinant 
human JUFff was purchased frcmi R&O Systems (MinneapoOs. MN). Both proteins were radiolat)eled 
using the commerdaily available solid phase agent, lOOO-QEN (Pierce). In ttiis procedure. 5 tig of 
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lOOO-GEN were plated at the bottom of a 10 x 75 mm glass tube and incubated for 20 minutes at 4* C 
with 75 ul of 0.1 M sodium phosphate, pH 7.4 and 20 ul (2 mCl) Na ^^\. This solution was then 
transferred to a second glass tube containing 5 ug TNFa (or TNF^) in 45 ul PBS for 20 minutes at 4* C. 
The reaction mixture was fractionated by gel filtration on a 2 ml t>ed volume of Sephadex 6*25 (Sigma) 

5 equilibrated in Roswell Park Memorial Institute (RPMt) 1 640 medium containing 2.5%^ (w/v) bovine 
serum albumin (BSA), 0.2% (w/v) sodium azide and 20 mM Hepes pH 7.4 (binding medium). The final 
pool of ^^l-TNF was diluted to a working stock solution of 1 x 10"^ M In binding medium and stored for 
up to one month at 4* C without detectable loss of receptor binding activity. The specific activity is 
routinely 1x10* cpm/mmoie TNF. 

10 6. Binding to intact Ceils. Binding assays with Intact cells were performed by two methods. In the first 
method, celts were first grown either in suspension (e.g.. U 937) or by adherence on tissue culture plates 
(e.gM W126*VA4, COS cells expressing the recombinant TNF receptor). Adherent cells were subsequentiy 
removed by treatment with SmM EDTA treatment for ten minutes at 37 degrees centigrade. Binding 
assays were then performed by a pthalate oil separation method (Dower et al., J, ImmunoL /32:751. 

15 1984) essentially as described by Park et al. (J. BloL Chem. 2ff/:4177. 1986). Non-specific binding of 
*^I-TNF was measured in the presence of a 200-fbkj or greater molar excess of unlabeled TNF. Sodium 
azide (0.2%) was included in a binding assay to inhibit internalization of ^^l*TNF by cells. In the second 
method. COS cells transfected with tfie TNF*R-contalning plasmid, and expressing TNF receptors on the 
surface, were tested for the abinty to bind ^^l-TNF by the plate binding assay described by Sims et al. 

20 (Science 24 7:585. 1 988). 

C. Soild Phase Binding Assays. The ability of TNF-R to be stably adsort>ed to nitrocellulose from 
detergent extracts of human cells yet retain TNF*binding activity provided a means of detecting TNF-R. 
Cell extracts were prepared by mixing a cell pellet with a 2 x volume of PBS containing 1% Triton X-1pO 
and a cocktail of protease Inhibitors (2 mM phenytmetiiyl sulfonyl fluoride, 10 uM pepstatin. 10 uM 

25 leupeptin. 2 mM o-phenanthrolino and 2 mM EQTA) by vigorous vortexing. The mixture was Incutmted 
on ice for 30 minutes after which it was centrifuged at 12.000x g for 15 minutes at 8* C to remove nuclei 
and other debris. Two micfoilter allquots of cell extracts were placed on dry BA85/21 nitrocellulose 
membranes (Schleicher and Schuelt. Ke^he, NH) and allpwed to dry. The membranes were incubate d in 
tissue culture dishes for 30 minutes In Tris (0.05 M) buffered saline (0.15 M) pH 7.5 containing 3% wTv 

30 BSA to block nonspecific binding sites. The membrane was then covered with 5x10"" M '=*I-TNF in 
PBS ^ 3% BSA and incubated for 2 hr at 4* C with shaking. At the end of this time, the membranes 
were washed 3 times in PBS. dried and placed on Kodak X<Omat AR film for 18 hr at -70* C. 

Example 2-^- 

35 



Isolation of Human TNF*R cDNA by Direct Expression of Active Protein In 008*7 Cells 

40 

Various human cell Ones were screened for expression of TNF-R based on their ability to bind ^^1- 
labeled TNF. The human fibroblast cell line Wl-26 VA4 was found to express a reasonable number of 
receptors per cell. Equilibrium binding stiKSies showed that the cell line exhibited biphasic binding of ^^1- 
TNF with approximately 4,000 high affinity sites (K, » 1 x 10'® M*') and 15.00 low affinity sites (K. « 1 x 
45 10" M"*)perceIL 

An unsized cONA ffbrary was constructed by reverse transcription of polyadenylated mRNA Isolated 
from total RNA extracted from human fibroblast Wl-26 VA4 cells grown in the presence of pokeweed 
mitogen using standard techniques (Gubier. et al., Gene 26rJ2SA. 1983; Ausut)el et al.. eds.. Oarent 
Protocols In Molecular Biology, Vol. 1. 1987). The cells were han/ested by lysing the cells in a guanidine 
50 hydrochtoride solution and total RNA isolated as previously described (March et al.. Nature 3/£641, 1985). 
Poly A* RNA was isolated by ollgo dT cellulose diromatography and double-stranded cONA was 
. prepared by a method similar to that of Qubier and Hoffman (Gene 2fi263. 1983). Briefly, the poly A* Rl^ 
was converted to an RNA-cpNA hybrid by reverse transcriptase using otigo dT as a primer. The RNA^NA 
hybrid was tiien converted into double-stranded cDNA using RNAase H in combination with DNA poly- 
66 merase I. The resulting .cikxible stranded cONA was blunt-ended with T4 ONA polymerase. To the blunt- 
ended cONA Is added EcoRi linker-adapters (having internal A/oft sites) which were phosphorytated on only 
one end (Invitrogen). The tinker-adaptered cDNA was treated with T4 polynucleotide kinase to 
phosphorylate the 5' overhanging region of the linker-adapter and unligated linkers were removed by 
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running the cDNA over a Sepharose CL4B column. The linker-adaptered cONA was ligated to an equimoiar 
concentration of Ecoffl cut and dephosphorylated amis of bacteriophage XgtIO (Huynh et aJ, DNA Qonlng: 
A PractlCBi Approach, Glover, ed.. IRL Press, pp, 49-78). The ligated DNA was packaged into phage 
particles using a commercially available kit to generate a library of recombinants (Stratagene Cloning 

5 Systems. San Diego. CA. USA). Recombinants were further amplified by plating phage on a bacterial lawn 
of £. CC// strain c600(hfl"). 

Phage DNA was purified from the resulting Xgt10 cDNA library and the cONA Inserts excised by 
digestion with the restriction enzyme Not\, Following electrophoresis of the digest through an agarose gel, 
cDNAs greater than 2,000 bp were isolated. 

10 The resulting cDNAs were ligated into the eukaryotic expression vector pCAV/NOT, which was 
designed to express cONA sequences inserted at Its multiple cloning site when transfected into mammalian 
cells. pCAV/NOT was assembled from pDC201 (a derivative of pMLSV. previously described by Cosman et 
al.. Nature 312: 768. 1984), SV40 and cytomegalovirus DNA and comprises, in sequence with the direction 
of transcription from the origin of replication: (1) SV40 sequences from coordinates 5171-270 Including the 

IS origin of replication, enhancer sequences and early and late promoters; (2) cytomegalovirus sequences 
including the promoter and enhancer regions (nucleotides 871 to -(*63 from the sequence published by 
Boechart et al. (Cell 4/:521. 1985); (3) adenovtrus-2 sequences containing ttie first exon and part of tiie 
intron between tiie first and second exons of the tripartite leader, the second exon and part of the third exon 
of the tripartite leader and a multiple cloning site (MCS) containing sites for Xhol. Kpnl, Smal. NotI and 

20 eg/1; (4) SV40 sequences from coordinates 4127-4100 and 2770*2533 that Include the polyadenylation and 
termination signals for early transcription; (5) sequences derived from pBR322 and virus-associated 
sequences VAI and VAII of pOC201, witii adenovirus sequences 10532-11158 containing the VAl and VAII 
genes, followed by pBR322 sequences from 4363-2488 and 1094-375 containing the ampidltin resistance 
gene and origin of replication. 

25 The resulting W1-28 VA4 cONA library in pCAV/NOT was used to transfom) E. coli strain DH5a. and 
recombinants were plated to provide approximately 800 colonies per plate and sufficient plates to provkle 
approximately 50.000 total colonies per screen. Colonies were scraped from each plate, pooled, and 
plasmid DNA prepared from each pool. Tnet>ooled DNA was th^ used to transfect a sub-confluenf faye^fiL 
monkey COS-7 cells using DEAE-dextran foltowed by chloroquine treatment as described by Luthman et 

30 al. {Nuci. Adds Res, n:l295. 1983) and McCutchan et al. {d NatL Cancer /nst 4;:351. 1986). The cells 
were then grown In culture for tiiree days to permit transient expression of the inserted sequences. After 
three days, cell culture supematants were discarded and tiie cell monolayers in each plate assayed for TNF 
binding as follows. Three ml of binding medium containing 1.2 x 10"" M '^'Habeled FIAG^-TNF was 
added to each plate and the plates incubated at 4*C fBn20 minutes. This medium was tiien discarded, 

35 and each plate was washed once with coki binding medium (containing no labeled TNF) and twice witin cold 
PBS..The edges of each plate were then broken off, leaving a flat disk which was contacted with X-ray film 
for 72 hours at -70*0 using an intensifying screen. TNF binding activity was visualized on tiie exposed 
films as a daric focus against a relatively uniform background. 

After approximately 240,000 recombinants the library had been screened in this manner, one transfec- 

40 tant pool was observed to provide TNF binding fod which were clearly apparent against tiie background 
exposure. 

A frozen stock of bacteria from tiie positive pool was ttien used to obtain plates of approximately 150 
cotonies. Replicas of these plates were made on nitrocellulose filters, and the plates were then scraped and 
plasmki DNA prepared and transfected as described above to Identify a positive plate. Bacteria from 

45 incfivtdual cok^nlee from the nitrocellutose repUca of this plate were grown in 0.2 ml cultures, which were 
used to obtain plasmkl DNA, which was transfected into COS-7 celts as descrit)ed above. In this manner, a 
single done, clone 1 , was isolated which was capable of Inducing expression of hurrutn TNF-R In COS cells. 
The expression vector pCAV/NOT containing the TNF-R cDNA clone 1 has been deported witti tiie 
American Type Cuttura Collection, 12301 Paridawn Drive, RockviUe, MD 20852. USA (Accession 1^. 68088) 

50 under the name pCAV/NOT-TNF-R, on 6tti Sept 1989. 



Examples 

66 

Construction of cDNAs Encoding Sofuble huTNF-RA235 
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A cDNA encoding a soluble huTNF-RA235 (having the sequence of amino adds 1-235 of Figure 2A) 
was constructed by excising an 840 bp fragment from pCAV/NOT-TNF-R with the restriction enzymes Noti 
and Pvu2. Noti cuts at the multiple cloning site of pCAV/NOT-TNF-R and Pvu2 cuts within the TNF-R 
coding region 20 nucleotides s' of the transmembrane region. In order to reconstruct the 3' end of the TNF* 
R sequences, two oligonucleotides were synthesized and annealed to create the following oligonucleotide 
linker 

Pvu2 BaaiHl Bgl2 

CTGAAGGGAGCACTGGCGACZ&&GGATCCA 
GACTTCCCTCGTGACCGCTGATTCCTAGCTCTAG 
Al aGluGly S« rThrOlyAapEad 

This oligonucleotide linker has terminal Pvu2 and Bgl2 restriction sites, regenerates 20 nucleotides of the 
TNF-R. followed by a termination codon (underlined) and a BamHI restriction site (for convenience in 
Isolating the entire soluble TNF-R by Not1 /BamHI digestion). This ollgonucteotide was then ligated vnth the 
840 bp Not1/Pvu2 TNF-R Insert Into Bgl2/Not1 cut pCAV/NOT to ytekJ psomuTNFRA235/CAVNOT. which 
was transfectsd into COS-7 cells as described above. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 



Example 4 



Construction of cDNAs Encoding Soluble huTNF-RA185 



A cONA encoding a soluble huTNF-RA185 (having the'seqiiShce of amino acids 1-185 of Figure 2Ay — 
was constmcted by excising a 640 bp fragment from pCAV/NOT-TNFR with the restriction enzymes Noti 
and Bgi2. Noti cuts at the multiple cloning site of pCAV/NO-TNF-R and Bgt2 cuts within the TNF-R coding 
region at nucleotide 637. which Is 237 nucleotides 5 of the transmembrane region. The following 
oligonucleotide linkers were synthesized: 

Bgl2 

5 * -GATCTGTAACGTGGTGGCCATCCCTGGGAATGCAAGCATGGATGC-3 * 
ACATTGCACCACCGGTAGGGACCCTTACGTTCG 
IleCysAsnValValAlalleProGlyAanAlaSerMetAspAla 



Not! 

5 • - AGTCTGCACGTCCACGTCCCCCACCCGGlfiAGC -3 • 

TACCTACGTCAGACGTGCAGGTGCAGGGGGTGGGCCACTCGCCGG 
ValCysThrSerthrSerProThrArgEnd 

The above digonudeoUde linkers reconstruct ttie 3' end of the receptor molecule up to nucleotide 708. 
followed by a termination codon (underlined). These oOgonucleotides were ttien ligated with ttw 840 bp 
Not! TNF-R insert Into Noti cut pCAV/NOT to yiekl the expression vector psorrNFRA18S/CAVNOT, whteh 
was transfected bito COS-7 cells as described above. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 



Example 8 



Constniction of cDNAs Encoding Soluble huTNF-RA183 
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A cONA encoding a soluble huTNF-RAie3 (having the sequence of amino acids 1-163 of Figure 2A) 
was constriicted by exdsing a 640 bp fragment from from pCAV/NOT-TNFR with the restriction enzymes 
Noti and Bgt2 as described in Example 4. The following oligonucleotide linkers were synthesized: 

5 Bgl2 Motl 

5 • •GATCTGTISAGC - 3 • 

ACAACTCGCCGG 
IleCysEnd 

This above oligonucleotide linker reconstructs the 3 end of the receptor molecule up to nucleotide 642 
(amino add 163). followed by a termination codon (underlined). This oligonucleotide was then llgated with 
the 640 bp Noti TNF-R insert into Noti cut pCAV/NOT to yieW the expression vector 
psoITNFRAieS/CAVNOT, which was transfected Into COS-7 cells as described above. This expression 
,5 vector induced expression of soluble human TNF-R which was capable of binding TNF in the binding assay 
described In Example 1. 



Example 6 

20 



Construction of cDNAs Encoding Soluble huTNF-RAI 42 



A cDNA encoding a soluble huTNF-RA142 (having the sequence of amino acids 1-142 of Rgure 2A) 
vras constructed by exdsing a 550 bp fragment from from pCAV/NOT-TNF-R with the restriction enzymes 
Noti and AJwNI. AlwNI cuts within, ^Ihf THF-R coding region at nudeotide 549. The foUowing 
oligonucleotide linker was synthesized: - - — 

30 

B9I2 Noti 
5 * K:TGAAACATCAGACGTGGTGTGCAAGCCCTGTZa2A*3 ' 
CTTGACTTTGTAGTCTGCACCACACGTTCGGGACAATTTCTAGA 

^ End 

33 

This above oligonucleotide linker reconstructs tiie 3' end of the receptor molecule up to nucleotide 579 
(amino add 142). folkmed by a termination codon (underiined). This oligonucleotide was then ligated with 
the 550 bp Notl/AIwNI TNF-R insert Into Not1/Bgl2 cut pCAV/NOT to yiekl the expression vector 
^ psoiTNFRA142/CAVNOT. which was transfected into COS-7 celts as described above. This expression 
vector (M not induced expression of soluble human TNF-R which .was capable of binding TNF. It is 
believed that tttis particular construct failed to express biologically active TNF-R because one or more 
essential cysteine residue (e.g., Cys'*' or Cys**^ required for intramolecular bonding (for formation of the 
proper tertiary structure of ttie TNF-R molecule) was eliminated. 

45 

Example 7 



Expression of Soluble TNF Receptors in CHO Cells 



Soluble TNF receptor was expressed in Chinese Hamster Ovary (CHO) cells using the glutamlne- 
synthetase (QS) gene ampTification system, substantially as described in PCT patent application Nos. 
W0874)4462 and WO8da>1036. Brfefty, CHO cells are transfected with an expression vector containing 
genes for both TNF-R and GS. CHO cells are selected for QS gene expression based on the ability of the 
transfected ONA to confer resistance to k>w levels of methionine sulphoximine (MSX). GS sequence 
amplification events tn such celts are selected using elevated MSX concentrations In this way. contiguous 
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TNF-R sequences are also amplified and enhanced TNF-R expression is achieved. 

The vector used In the GS expression system was psolTNFR/Pe/PSVLGS, which was constructed as 
follows. Rrst. the vector pSVLGS.1 (described in PCT Application Nos. W087A)4462 and WO89A}103e. and 
availat}te from Celttech, Ltd.. Berkshire. UK) was cut with the BamHI restriction enzyme and de- 

5 phosphorylated with calf Intestinal alkaline phosphatase (ClAP) to prevent the vector from religating to itself. 
The BamHI cut pSVLQS.1 fragment was then ligated to a 2.4 kb BamHI to Bgl2 fragment of pEE6hCMV 
(described in PCT Application No. WG89/01036. also available from Ceiltech) which was cut witii Bgl2, 
BamHI and Fspl to avoid two fragments of similar size, to yield an 11.2 kb vector designated 
p6/PSVLGS.1. pSVLGS.1 contains tiie glutamine synthetase selectable maricer gene under control of the 

10 SV40 later promoter. The BamHI to Bgl2 fragment of pEEOhCMV contains the human cytomegatovinjs 
major immediate eariy promoter (hCMV). a polylinker. and the SV40 eariy pclyadenyjation signal. The 
coding sequences for soluble TNF-R were added to p6/PSVLGS.1 by excising a Noti to BamHI fragment 
from the expression vector psolTNFR/CAVNOT (made according to Example 3 above), blunt ending witii 
Klenow and ligating with Smal cut dephosphorylated pS/PSVLGS.I, thereby placing the solTNF-R coding 

IS sequences under the control of the hCMV promoter. This resulted in a single ptasmid vector in which the 
SV4a/GS and hCMB/solTNF-R transcription units are transcribed in opposite directions. TTiis vector was 
designated psolTNFR/P6/PSVLQS. 

psolTNFR/P6/PSVLGS was used to transfect CHOKI cells (available from ATCC, Rochville, MD, under 
accession number CCL 61) as follows. A rnonolayer of CH0-K1 cells were grown to subconfluency in 

20 Minimum Essential Medium (MEM) 10X (Gibco: 330-1 581 AJ) without glutamine and supplemented witii 
10% dialysed fetal bovine serum (Gibco: 220-6300AJ), 1 mM sodium pyruvate (Sigma), MEM non-essential 
amino acids (Gibco: 320-1 140AQ), 500 uM asparagine and gtutamate (Sigma) and nucleosides (30 uM 
adenosine, guanosine, cytidine and uridine and 10 uM thymldlneXSIgma). 

Approximately 1x10* cells per 10 cm petri dish were transfected wrth 10 ug of psolTNFR/P6/PSVLQ^~ 

25 by standard caldum phosphate precipitation, substantially as described by Graham & van der Eb, Virology 
52:456 (1983). Cells were subjected to glycerol shock (15% glycerol in serum-free culture medium for 
approximately 1.5 minutes) approximately 4 hours after transfectton, substantially as described by Frost & 
Williams, Virology 91:39 (1978), and then vmhed with serum-free medium. One day later, transfected cells 
were fed witii fresh selective medium containing MSX at allnaJ fiShcentration of 25 uM. Colonies of MSX* 

30 resistant surviving cells were visible witiiin 3-4 weeks. Surviving colonies were transferred to 24-weIl plates 
and allowed to grow to confluency in selective medium. Conditioned medium from confluent wells were 
then assayed for soluble TNFR activity using ttie binding assay described in Example 1 above. These 
assays indicated that tiie colonies expressed biok>gically active soluble TNF-R. 

In order to select for GS gene amplification,. sevesaL^SX-resistant cell lines are transfected witii 

35 psolTNFR/P6/PSVLGS and grown in various concentrations of MSX. For each cell line, approximately 1x10^ 
cells are plated in gradually increasing concentrations of 100 uM, 250 uM, 500 uM and 1 mM MSX and 
incubated for 10-14 days. After 12 days, colonies resistant to tiie higher levels of MSX appear. The 
surviving colonies are assayed for TNF-R activity using tiie binding assay described above in Example 1. 
Each of tiiese highly resistant cell lines contains cells which arise from multiple independent amplification 

40 events. From these cells lines, one or more of the most highly resistant cells lines are isolated. The 
amplified cells with high production rates are tiien cloned by nmiting dilution cloning. Mass cell cultures of 
the transfectants secrete active soluble TNF*^ 



45 Example 8 



Expression of Soluble Human TNF-R In Yeast 

50 

Soluble human TNF-R was expressed In yeast witii tiie expression vector pDCY432, which was derived 
from tiie yeast expresskxi vector pDCyi20 and plasmid pYEP352. piXY120 is Identical to pYdHuGM (ATCC 
53157), except tiiat it contains no cONA Insert and includes a polyfinker/multiple cloning site' with a Nool 
55 restriction site. 

A DNA fragment encoding TNF receptor and suitable for cloning Into tiie yeast expres^on vector 
plXY120 was first generated by polymerase chain reaction (PCR) ampllftoation of tiie exb^llular portion of 
tiie full lengtii receptor from pCAV/NOT-TNFR (ATCC 66088). The following primers were used In tills PCR 
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ampfiffcation: 



fEnd Primer 

^ 5 • -TTCCGGTACCTTTGGATAAAAGAGACTACAAGGAC 

A3p7 1 8 ->ProLeuA3pLy 3ArgA3pTyrLysAap 

GACGATGACAAGTTGCCCGCCCAGGTGGCATTTACA-3 • 
AspAapAspLy3<— TNF-R~ ^> 

10 

3' End Primer fantiacna) 

5 • -CCCGGGATCCllAGTCGCCAGTGCTCCCTTCAGCTGGG-3 • 

The 5 end oligonucleotide primer used in the amplification Included an Asp718 restriction site at its 5 end, 
followed by nucleotides encoding the z' end of the yeast a-factor leader sequence (Pro-Leu-Asp-Lys-Arg) 
and tiiose encoding \he 8 amino adds of ttie FLAG® peptide (AspTyrLysAspAspAspAspLys) fused to 

20 sequence encoding ttie 5 end of the mature receptor. The FLAG® peptide (Hopp et al.. Bio/Technology 
5:1204, 1988) Is a highly antigenic sequence which reversibly binds ttie monoclonal antibody Ml (ATCC HB 
9259). The oligonucleotide used to generate ttie 3' end of ttie PCR-derived fragment is ttie antlsense 
strand of DNA encoding sequences which temiinate ttne open reading frame of ttie receptor after nucleotide 
704 of ttie mature coding region (following ttie Asp residue preceding ttie transmembrane domain) by 

25 introducing a TAA stop codon (underiined). The stop codon is ttien followed by a BamHI restriction site. 
The DNA sequences encoding TNF-fl are ttien amplified by PCR substantially as described by Innis et aJ., 
eds.. PCR Protocols: A Guide^ to Methods and Applications (Academic Press. 1990). 

The PCR-derived Dl^ fragment eooqding soluble human TNF-R was subcloned Into ttia yeast 
expression vector plXY120 by digesting ttie PCR-derived DNA'fftgment witti BamHI and Asp718 resfricB9fr 

30 enzymes, digesting pDCY120 witti BamHI and Asp718, and ligating ttie PCR fragment into ttie cut vector In 
vitro witti T4 DNA llgase. The resulting constmctlon (plXY424) fused ttie open reading frame of ttie FLAG«- 
soluble TNF receptor in-frame to ttie complete o-factor leader sequence and placed expression in yeast 
under ttie aegis of ttie regulated yeast alcohol dehydrogenase (ADH2) promoter. Identity of tfie nucleotide 
sequence of ttie soiubte TNF receptor carried in plXY42ijirftti ttiose In cOI^ clone 1 were verified by DNA 

35 sequencing using ttie dideoxynucleotide chain termination mettiod. pIXY424 was ttien transformed into £ 
CO// strain RR1. 

Soluble human TNF receptor was also expressed and secreted In yeast In a second vector This 
second vector was generated by recovering ttie plXY424 plasmld from E, coll and digesting witti BmRI 
and BamHI restriction enzymes to isolate ttie fragment spanning ttie region encoding ttie ADH2 promoter, 
40 ttie a-factor leader, ttie FLAQ«-soluble TNF receptor and ttie stop codon. This fragment was Hgated In vitro 
into EcoRI and BamHI cut plasmld pYEP352 (Hill et ai^ Yeast 2tie3 (1988)). to yield ttie expression 
plasmid plXY4a2. which was transformed into £co// strain RR1 . 

To assess secretion of ttie soluble human TNF receptor from yeast plXY424 was purified and 
Introduced into a diploid yeast strain of S. cerevlslae pCV2 181) by electroporation and selection for 

49 acquisition of ttie plasmld-bome yeast TRP1 * gene on media lacking tryptophan. To assess secretion of ttie 
receptor directed by plXY432. ttie plasmld was Intrcduced into ttie yeast strain FBI 48^ by electroporation 
followed by selection for the plasmid-bome URA3* gene witti growtti on media lacking uradL Overnight 
cultures were grown at 30* C In ttie appropriate selective media. The PB149-6b/|pDCY434 transfbmiants were 
diluted into YEP-1% glucose media and grown at 30* C for 38^ hours- Supematants were prepared by 

50 removal of cells by centrffugation. and fittration of supematants ttirough 0.45U fUters. 

The level of secreted receptor in ttie supematants was determined by lmmuno-dott)lot Briefly, 1 ul of 
supematants. and dilutions of ttie supematants. were spotted onto nittocellulose filters and allowed to dry. 
After blocking non-spedfic protein bending witti a 3% 8SA solution, ttie filters were incubated witti diluted 
Ml anti-FLAG^ antibody, excess antibody was removed by washing and ttien dilutions of horseradl^ 
65 peroxidase conjugated anti-mouse IgQ antibodies were Incubated witti ttie filters. After removal of excess 
secondary antibodies, peroxidase substrates were added and cok>r devek)pment was allowed to proceed for 
approximately 10 minutes prior to removal of the sut>strate solution. 

The anti-FLAQA reactive material found In ttie supematants demonstrated ttiat significant levels of 
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receptor were secreted by twth expression systems. Comparisons demonstrated that the plXY432 system 
secreted approximately 8-16 times more soluble human TNF receptor than the plXY424 system. The 
supematants were assayed for soluble TNFR activity, as described in Example 1 . by their ability to bind 
^25|-TNFa and block TNFa binding. The plXY432 supematants were found to contain significant levels of 
5 active soluble TNF-R. 



Example 9 

10 

Isolation of Murine TNF-R cDNAs 



T5 Murine TNF-R cDNAs were isolated from a cDNA library made from murine 7B9 cells, an antigen- 
dependent helper T cell line derived from C57BU6 mice, by cross-spedes hybridization with a human TNF* 
R probe. The cONA Qbrary was constructed In XZAP (Stratagene. San Diego), substantially as described 
above in Example 2. by isolating polyadenylated RNA from the 7B9 cells. 

A double-stranded human TNF-R cONA probe was produced by excising an approximately 3.5 kb Noti 

20 fragment of the human TNF-R clone 1 and ^ap-iabeOng the cONA using random primers (Boehringer- 
Mannheim). 

The murine cDNA library was amplified once and a total of 900,000 plaques were screened, substan- 
tially as described in Example 2, with the human TNF-R cDNA probe. Approximately 21 positive plaques 
were purified, and the Bluescript plasmids containing EcoRI -tinkered inserts were excised (Stratagene. S^" 
25 Olego). Nucleic acid sequencing of a portion of murine TNF-R ckme 11 indicated that the coding sequence 
of the murine TNF-R was approximately 88% homologous to the corresponding nucleotide sequence of 
human TNF-R. A partial nucleotide sequence of murine TNF-R cONA clone 11 Is set forth In Figures 3A-3B. 

30 Example 10 



Preparation of Monodongl Anybodies to TNF-R 

OS 

Preparations of purified recombinant TNF-R, for example, human TNF-R, or transfected COS cells 
expressing high levels of TNF-R are employed to generate monoclonal antibodies against TNF-R using 
conventional techniques, for example, those disclosed in U.S. Patent 4,411,993. Such antibodies likely to be 
40 useful In interfering with TNF binding to TNF receptors, for example, in ameliorating toxic or other 
undesired effects of TNF. or as cotnponents of diagnostic or research assays for TNF or soluble TNF 
receptor. 

To Immunize mice. TNF-R ImmurK>gen Is emulsified in complete Freund's adjuvant and Injected in 
amounts ranging from 10-100 ug subcutaneously Into Balb/c mice. Ten to twelve days later, the immunized 

45 animals are boosted with additional Immunogen emulsified in incomplete Freund's adjuvant and periodically 
txx>stBd thereafter on a weekly to biweekly immunization schedule. Serum samples are periodically taken 
by retro-orbitaJ bleedng or tail-tip excision for testing by dot-blot assay (antibody sandwich) or EUSA 
(enzyme-finked lnununosort)ent assay). Other assay procedures are also suitable. FbHowing detection of an 
appropriate antibody titer, positfva animals are given an Intravenous injection of antigen In saline. Three to 

so four days later, the animals are sacrificed, splenocytas harvested, and fused to the murine myeloma cell 
line NS1 . HybridOHM ceil lines generated by this procedure are plated in multiple microtltsr plates in a HAT 
selective medium (hypoxanthine, aminopterin. and thymidine) to Inhibit profiferatkMi of non-fused cells, 
myekxna hybrids, and spleen ceO hybrids. 

Hybddoma ctones thus generated can be screened by EUSA for reactivity with TNF-R, for example, by 

65 adaptations of the tachmques disck>sed by Engvall et aL, bnmunoctiem. ft871 (1971) and in U.S. Patent 
4,703,004. Positive done^^are then Injected into the peritoneal cavities of syng^ieic Baib/b mice to produce 
ascites containing high concentrations ^1 mg/ml) of anti-TNF-R monockmai antibody. The resulting 
monock>nal antibody can be purified by ammonium sutfate precipitation folk>wed by gel exclusion 
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chromatography, and/or affinity chromatography based on binding of antibody to Protein A of Staphyk 
COS aureus. 



s Claims 

1. An isolated DNA sequence encoding a biologically active mammalian TNF receptor (TNF«R) protein. 

2. An isolated DNA sequence according to claim 1 , selected from the group consisting of: 

(a) cDNA clones having a nucleotide sequence derived from the coding region of a native mammalian 
10 TNF-R gene: 

(b) DMA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
(50* C. 2 X SSC) and which encode biologically active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genedc code to the DNA sequences defined 
in (a) and (b) and which encode bloioglcalty active TNF-R protein. 

15 3. An isolated DNA sequence according to claim 1 which encodes a soluble human TNF-R protein. 

4. An isolated DNA sequence according to claim 3. wherein the soluble human TNF-R protein has an amino 
acid sequence comprises the sequence of amino acid residues 1-x of Figure 2A, wherein x is selected from 
the group consisting of amino adds 163-235 

5. An isolated DNA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
20 the sequence of amino adds 1-235 of Figure 2A 

6. A DNA sequence according to claim 5. wherein amino add residue 48 Is selected from the group 
consisting of He and Thr and amino add residue 118 is selected from the group consisting of Vat and lie. 

7. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
the sequence of amino adds 1-185 of Rgure 2A 

25 8. An isolated DNA sequence according to daim 3, wherein the soluble human TNF-R protein comprises 
the sequence of amino adds 1*163 of Rgure 2A 

9. A recombinant expression vector comprising a DNA seqi^nce according to any one of claims 1*8. 

10. A process for preparing a biological^ ective mammallaa JNF receptor (TNF-R) protein, com'Drf sinQ 
cuituring a suitable host cell comprising a vector according to daim 8 under conditions promoting 

30 expression. 

1 1 . A purified biologically active mammalian TNF receptor (TNF-R) protein. 

12. A purified biotogicalty active soluble human TNF-R protein. 

13. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
add residues 1-235 of Rgure 2A. - * 

35 14. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
add residues 1-185 of Rgure 2A. 

15. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
add residues 1-163 of Rgure 2A. 

16. The use of a mammaJian TNF-R protein in preparing a medicament for regulating immune responses in 
40 mammals. 

17. The method of daim 16. wherein the TNF-R protein Is human TNF-R and the mammal to be treated is a 
human. 

18. The use of mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
admlnistratfon to a human patierrt for regulating immune responses. 

4$ 19. A process for detecting TNF or TNF-R molecules or the Interaction thereof, comprising use of a 
mammalian TNF receptor protein, a soluble TNF receptor protein capable of binding TNF or substantially 
similar TNF-R analog produced by recombinant cell culture. 
20. Antitxxfies Immunoreactive with mammalian TNF receptors. 

50 Claims for the following Contracting State: ES 

1. A process for preparing a purified mammalian TNF receptor (TNFR) protein, the process comprising 
coupling together successive amino add residues by the fbnmatlon of peptide bonds to form a TNF-R 
polypeptide. 

55 2. A process according ta claim 1, wherein the TNF-R protein Is a soluble human TNF-R protein. 

3. A process according to claim 2, wherein the soluble TNFR protein has an amino add sequence 
comprising the sequence of amino add residues 1-x of Rgure 2A. wherein x Is selected from the group 
consisting.of amino adds 183-235. 



20 



EP 0 418 014 A1 



4. A process according to claim 3, wherein the soluble TNF-R protein has an amio acid sequence which 
comprises the sequence of amino acid residues 1-235 of Rgure 2A. 

5. A process according to claim 3, wherein the soluble TNF*R protein has an amlo add sequence which 
comprises the sequence of amino acid residues M85 of Rgure 2A. 

5 6. A process according to claim 3. wherein the soluble TNF<R protein has an amio acid sequence which 
comprises the sequence of amino acid residues 1-163 of Rgure 2A. 

7. The use of a mammalian TNF-R protein In preparing a medicament for regulating immune responses in 
mammals. 

8. The use of a mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
70 administration to a human patient for regulating Immune responses* 

9. A process for preparing a DNA sequence encoding a mammalian TNF receptor (TNF-R) protein, the 
process comprising coupling together successive nucleotide residues. 

10. A process for preparing a ONA sequence according to claim 9. wherein the DNA sequence encodes a 
soluble human TNF-R protein. 

15 11. A process for preparing a ONA sequence according to claim 10. wherein the ONA sequence encodes a 
soluble TNF-R protein having an amino add sequence comprising the sequence of amino add residues l-x 
of Rgure 2A, wherein x is selected from the group consisting of amino adds 163-235. 

12. A process for preparing a DNA sequence according to claim 10. wherein the DNA sequence encodes a 
sotuble TNF-R protein having an amio add sequence which comprises the sequence of amino acid 

20 residues 1-235 of Rgure 2A. 

13. A process for preparing a ONA sequence according to claim 10. wherein the DNA sequence encodes a 
soluble TNF-R protein having an amio add sequence which comprises the sequence of amino add 
residues 1-185 of Rgure 2A. 

14. A process for preparing a DNA sequence according to claim 10. wherein the DNA sequence encodes a 
2S soluble TNF-R protein having an amio add sequence which comprises the sequence of amino add 

residues M63 of Rgure 2A. 

15. A process for preparing a DNA sequence according to claim 9. said DNA being selected from the group 
consisting of: - - -.^ " . 

(a) cONA clones having a nucleotide sequence derived from the coding region of a native mammalian 
30 TNF-R gene; 

(b) DNA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
(50* C, 2 X SSC) and which encode biologically active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genetic code to the DNA sequences defined 
in (a) and (b) and which encode biologicaily active TN^JTprotein. 

35 18. A process for preparing a DNA sequence according to claim 9, said DNA encoding a TNF-R protein 
having the sequence of amino adds of the TNF-R protein expressed by pCAV/NOT-TNF-R (ATCC 68088). 

17. A process for preparing a recombinant expression vector, comprising ligating bacterial, yeast or 
mammalian expression vector DNA and a DNA sequence encoding a human TNF-R protein sequence. 

18. A process for preparing a mammalian TNF-R or an analog thereof, comprising cutturing a suitable host 
40 cell comprising a vector prepared according to daim 17 under conditions promoting expression. 

19. A process for detecting TNF or TNF-R protein molecules or the interaction thereof, comprising use of a 
mammalian TNF-R protein, a soluble TNF-R protein capable of binding TNF or substantially similar TNF-R 
analog produced by recombinant cell culture. 

20. A process for the preparation of antibodies immunoreactlve with TNF receptor, the process comprising 
45 either (a) culturing a hybridoma cell expressing the antibodies and harvesting the antibodies, or (b) 

harvesting antibodies Immunoreactlve with TNF receptor from an appropriately Immunised animal. 

Claims for the following Contracting State: GR 

60 1 . An Isolated DNA sequence encoding a bldogicalty active mammalian TNF receptor (TNF-R) protein. 

2. An isolated ONA sequence according to claim 1. selected from the group consisting of: 

(a) cDNA dones having a nudeotide sequence derived from the coding region of a native mammalian 
TNF-R gene; 

(b) DNA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
55 (50 '.C, 2 X SSC) and which encode blologicalty active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genetic code to the DNA sequences defined 
in (a) and (b) and which encode blologicalty acth/e TNF-R protein. 

3. An Isolated ONA sequence according to daim 1 whteh encodes a soluble human TNF-R protein. 
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4. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein has an amino 
add sequence comprising the sequence of amino add residues 1-x of Rgure 2K wherein x is selected 
from the group consisting of amino acids 163-235 

5. An isolated ONA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
5 the sequence of amino adds 1-235 of Figure 2A. 

6. An isolated DNA sequence acconding to claim 3. wherein the soluble human TNF-R protein comprises 
the sequence of amino adds 1*185 of Figure 2A. 

7. An isolated DNA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
the sequence of amino adds 1-163 of Rgure 2A. 

to 8. A DNA sequence according to claim 3. wherein amino add residue 46 Is selected from the group 
consisting of lie and Thr and amino add residue 118 is selected from the group consisting of Val and He. 

9. A recombinant expression vector comprising a DNA sequence according to any one of claims 1-7. 

10. A process for preparing a purified mammafian TNF receptor (TNF-R) protein, the process comprising 
coupling together successive amino add residues by the formation of peptide bonds to form a TNF-R 

75 polypeptide. 

1 1 . A process according to claim 9. wherein the TNF-R protein is a soluble human TNF-R protein. 

12. A process according to claim 11, wherein the soluble human TNF-R protein has an amino acid 
sequence comprising the sequence of amino add residues l-x of Rgure 2A, wherein x is selected from the 
group consisting of amino adds 163*235. 

so 13. A process according to claim 11, wherein the soluble human TNF-R protein has an amio acid sequence 
which comprises the sequence of amino add residues 1 -235 of Rgure 2A. 

14. A process according to claim 11, wherein the soluble human TNF-R protein has an amio add sequence 
which comprises the sequence of amino add residues 1-185 of Rgure 2A 

15. A process according to claim 11, wherein the soluble human TNF-R protein has an amio add sequence 
25 which comprises the sequence of amino add residues 1*163 of Rgure 2A. 

16. Tlie use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses in 
mammals. 

17. The use of a mammalian TNF-R phMn in prep^ng.^harmaceutica] composition suitatrte tor 
parenteral administration to a human patient for regulating Immune responses. 

30 18. Antibodies immunoreactive with mammalian TNF receptors. 



35 
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(2) IKTORMXTION FOR SBQ ID K0:1: 

(i) SEQUEKCS CHARACtSRXSTICS: 

(A) LENGTH: 1641 base paira 

(B) ms: nucleic tcld 

(C) STRAMDSDNESS; single 

(D) tOPOIiOGr: linear 

(ii) MOLECULE TYPE: CONA to mRNA 

<iii) anOTHETXCAL: N 

(iv) AHTI^SENSE: N 

(vi) ORIQIKAL SOURCE: 

(A) ORGANISM: Somo aapiena 

(G) CELL TY?E: Fibroblaat 

(H) CELL LIN^: 9ri-26 VA4 

(vii) IMMEDIATE SOURCE: 

(A> LIBRARY: HI -26 VA4 
<B) ChOm: 1 

<ix) FEATURE: 

(A) NAME/KEY: COS 

<B) LOCATION: 88.. 1473 

(D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: mat.peptide 

(B) LOCATION: 88.. 1470 
(D) OTHER INFOBMATIpN: 

(ix) FEATURE: ~ - - 

(A) NAME/KEY: aig^peptide 

(B) LOCATION: 88.. 153 
<0) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Saith , Craift A. ^ 

DaviSr Terxi 
Anderson, Dirk 
Solaa, Liaabeth 
Beckaann, M. P. 
Jerry, Rita 
Dower, Steven R. 
Coaman, David 
Goodirin, Raymnd G. 

(B) TITLE: A Receptor for Tusor Necrosis Factor Defines an 

Unusual Family of Cellular and Viral Proteins 

(C) JOURNAL: Science 

(D) VOLUME: 248 

(F) PAGES: 1019-1023 
(6) DATE: 25-MAY-1990 

(xi) SBOUENCS DESOUPTIOM: SEQ ID N0:1: 

GC6&GGCAGG CAGCCTG6AG AGAAGGCGCT GGGCTGCGAG GGCGCGAGGG CGCGAGGGCA 60 

GGGGGCAACC GGACCCCGCC CGCATCC ATO GCG CCC 6TC GCC GTC TGG GCC 111 

Net Ale Pro Val Ala val Trp Ala 
1 5 

GCG CTO GCC GTC GGA CTG GAG CTC TGG GOT GCG GOO CAC GCC TTG CCC 159 
Ala. Lau Ala Val Oly Leu Glu Lea Trp Ala Ala Ala His Ala L«u Pro 
10 IS 20 
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GCC CA6 GTG GCX TXT KCX CCC TAC GCC CCG GAG CCC GGG AGC ACA TGC 207 
AlA Gin Val Ala Ph« thr Pro tyr Ala Pro Glu Pro Gly Sar thr Cya 
25 30 35 40 

CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC AGC AAA 255 
Arg Leu ksq Glu Tyr Tyr Aap Gin Thr Ala Gin Met Cya Cya Ser Lya 
45 50 55 

TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC TCG GAC 303 
Cya Ser Pro Gly Gin Kla Ala Lya Val Phe Cya Thr Lya Thr Ser Aap 
60 65 70 

ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC 351 
Thr Val Cya Aap Ser Cya Glu Aap Ser Thr Tyr Thr Gin Leu Trp Aan 
75 80 85 

TGG GTT CCC GAG TGC TTG AGC TOT GGC TCC CGC TGT AGC TCT GAC CAG 399 
Trp Val Pro Glu Cya Leu Ser Cya Gly Ser Arg Cya Ser Ser Aap Gin 
90 95 100 

GTG GAA ACT CAA GCC TGC ACT COG GAA CAG AAC CGC ATC TGC ACC TGC 447 
Val Glu Thr Gin Ala Cya Thr Arg Glu Gin Aan Arg lie Cya Thr Cya 
105 110 115 120 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG CTG 495 
Arg Pro Gly Trp Tyr Cya Ala Leu Ser Lya Gin Glu Gly Cya Arg Leu 
125 130 135 

TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC AGA CCA 543 
Cya Ala Pro Leu Arg Lya Cya Arg Pro Gly Phe Gly Val Ala Arg Pro 
140 '!« - 150 

GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CO: TGT GCC CCG GGG ACG 591 
Gly Thr Glu Thr Ser Aap Val Val Cya Lya Pro Cya Ala Pro Gly Thr 
155 160 165 

TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGg^GG CCC CAC CAG ATC 639 
Phe Ser Aan Thr Thr Ser Ser Thr Aap lie Cya Arg Pro Hia Gin lie 
170 175 180 

TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA GTC TGC 687 
Cya Aan Val Val Ala lie Pro Gly Aan Ala Ser Met Aap Ala Val Cya 
185 190 195 200 

ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA CAC TTA 735 
Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val Hia Leu 
205 210 215 

CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT CCA GAA 783 
Pro Gin Pro Val Ser Thr Arg Ser Gin Hia Thr Gin Pro Thr Pro Glu 
220 225 230 

CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC CCC AGC 831 
Pro Ser Thr Ale Pro Ser Thr Ser Phe Leu Leu Pro Met Gly Pro Ser 
235 240 245 

CCC CCA GCT GAA GGG AGC ACTGGCGACTTCGCTCTTCCAGTT GGA CTG 879 
Pro Pro Ala Glu Gly Ser Thr Gly Aap Phe Ala Leu Pro Val Gly Leu 
250 255 260 

ATT GTG GGT GTG ACA.GCC TTG GGT CTA CTA ATA ATA GGA GTG GTG AAC 927 
lie Val Gly Val Thr Ala Leu Gly Leu Leu Zle Zle Gly Val Val Aan 
265 270 275 280 
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TGT GTC ATC ATG ACC CAG GTG AAA AAG AAG CCC TTG TGC CTG CAG AGA 975 
Cy3 Val lie Met Thr Gin Val Lya Lys Lya Pro Leu Cya Leu Gin Arg 
285 290 295 

GAA GCC AAG GTG CCT CAC TTG CCT GCC GAT AAG GCC CGG GGT ACA CAG 1023 
Glu Ala Lys Val Pro His Leu Pro Ala Aap Lys Ala Arg Gly Thr Gin 
300 305 310 

GGC CCC GAG CAG CAG CAC CTG CTG ATC ACA GCG CCG AGC TCC AGC AGC 1071 
Gly Pro Glu Gin Gin His Leu Leu Zle Thr Ala Pro Ser Ser Ser Ser 
315 320 325 

AGC TCC CTG GAG AGC TCG GCC AGT GCG TTG GAC AGA AG6 GCG CCC ACT 1119 
Ser Ser Leu Glu Ser Ser Ala Ser Ala Leu Asp Arg Arg Ala Pro Thr 
330 335 340 

CGG AAC CAG CCA CAG GCA CCA GGC GTG GAG GCC AGT GGG GCC GGG GAG 1167 
Arg Asn Gin Pro Gin Ala Pro Gly Val Glu Ala Ser Gly Ala Gly Glu 
345 350 355 360 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT GGG 1215 
Ala Arg Ala Ser Thr Gly Ser Ser Asp Ser Ser Pro Gly Gly His Gly 
365 370 375 

ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC TCT GAC 1263 
Thr Gin Val Asa Val Thr Cys lie Val Asn Val Cye Ser Ser Ser Asp 
380 385 390 

CAC AGC TCA CAG TGC TCC TCC CA^ GCC AGC TCC ACA ATG GGA GAC ACA 1311 
His Ser Ser Gin Cys Ser Ser Gin Xla Ser Ser Tb» Met Gly Asp Thr * 
395 400 405 

GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG GTC CCC TTC TCC 1359 
Asp Ser Ser Pro Ser Glu Ser Pro Lys Asp Glu Gin Val Pro Phe Ser 
410 415 420 

AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG ACG CCA GAG ACC CTG 1407 
Lys Glu Glu Cys Ala Phe Arg Ser Gin Leu Glu Thr Pro Glu Thr Leu 
425 430 435 440 

CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC CTT GGA GTG CCT GAT GCT 1455 
Leu Gly Ser Thr Glu Glu Lys Pro Leu Pro Leu Gly Val Pro Asp Ala 
445 450 455 

GGG ATG AAG CCC AGT TAA CCAGGCCGGT GTGGGCTGTG TCGTAGCCAA 1503 
Gly Met Lys Pro Ser 

460 N 

GGTGGGCTGA GCCCTGGCAG GATGACCCTG CGAAGGGGCC CTGGTCCTTC CAGGCCCCCA 1563 

CCACTAGGAC TCTGAGGCTC TTTCTGGGCC AAGTTCCTCT AGTGCCCTCC ACAGCCGCAG 1623 

CCTCCCTCTQ ACCTGCAO 1€41 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SSOOSMCli CHABACTERISTICS: 

(A) LE)R>TH: 462 aaino acids 

(B) fnZi amino acid 
(O) TOPOLOGT: linear 
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(11) HOLECULE TYPE: protein 

(xl) SEQUEKCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
1 5 10 15 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 
20 25 30 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Aap Gin 
35 40 45 

Thr Ala Gin Met Cya Cya Ser Lys Cya Ser Pro Gly Gin Hia Ala Lya 
50 55 60 

Val Phe Cya Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 95 

Gly Ser Arg Cys Ser Ser Asp Gin val Glu Thr Gin Ala Cys Thr Arg 
100 105 110 

Glu Gin Asn Arg lie Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 ^ 14fi. 

Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
145 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 ^ 175 

Asp lie Cys Arg Pro His Gin lie Cys Asn Val Val Ala lie Pro Gly 
180 185 190 

Asn Ala Ser Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

Gin Hia Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 
225 230 235 240 

Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly 
245 250 255 

Asp Phe Ala Leu Pro Val Gly Leu He Val Gly Val Thr Ala Leu Gly 

260 265 270 

Leu Leu He He Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 
275 2S0 285 

Lys Lys Pro Leu Cys. Leu Gin Arg Glu Ala Lys Val Pro His Leu Pro 
290 295 300 
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Ala Asp Lys Ala Arg Gly Thr Gin Gly Pro Glu Gin Gin His Iiau Leu 
305 310 315 320 

He Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 
325 330 335 

Ala Leu Aap Arg Arg Ala Pro Thr Arg Aan Gin Pro Gin Ala Pro Gly 
340 345 350 

Val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala Ser Thr Gly Ser Ser 
355 360 365 

Asp Ser Ser Pro Gly Gly Bis Gly Thr Gin Val Asn Val Thr Cys He 
370 375 380 

Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Ser Gin 
385 390 39S 400 

Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro 
405 410 415 

Lys Asp Glu Gin Val Pro Phe Ser Lys Glu Glu Cys Ala Phe Arg Ser 
420 425 430 

Gin Leu Glu Thr Pro Glu Thr Leu Leu Gly Ser Thr Qlu Glu Lys Pro 
435 440 445 

Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys Pro Ser • 

450 455 .^^^ 460 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 3813 base pairs 
(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGy: linear 

(ii) MOLECULE TYPE: cONA to raRNA 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(B) STRAIN: C57BL/6 

(G) CELL TYPE: T-helper cell 

(H) CELL LINE: 7B9 

(▼ii) IMMEDIATE SOURCE: 
(B) CLONE: 11 

[Lx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 55.. 1479 
(0) OTHER INFORMATION: 

{ix) FEATURE: 

(A) NAME/ICSY: inatj>eptide 

(B) LOCATION: 55. .1476 
(D) OTHER INFORIATIC»l: 
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lix) FBATURE: 

(A) NAME/KEY: 3i9_peptida 

(B) LOCAtZOH: 55.. 120 

(D) OTHER INFORMATZON: 



(xi) SEQUEKCE DESCRZPTXOM; SEQ IB K0:3: 
CGCAGCTGAG GCACTAGAGC TCCAGGCACA AGGGCGGGAG CCACCGCTGC CCCT ATG 57 

1 

GCG CCC GCC GCC CTC TGG GTC GCG CTG GTC TTC GAA CTG CAG CTG TGG 105 
Ala Pro Ala Ala lieu Trp Val Ala Lau Val Pha Glu Leu Gin teu Trp 
5 10 15 

GCC ACC GGG CAC ACA GTG CCC GCC CAG GTT GTC TTG ACA CCC TAC AAA 153 
Ala Thr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr Lys 
20 25 30 

CCG GAA CCT GGG TAC GAG TGC CAG ATC TCA CAG GAA TAC TAT GAC AGG 201 
Pro Glu Pro Gly Tyr Glu Cya Gin lie Ser Gin Glu Tyr Tyr Asp Arg 
35 40 45 

AAG GCT CAG ATG TGC TGT GCT AAG TGT CCT CCT GGC CAA TAT GTG AAA 249 
Lys Ala Gin Met Cys Cya Ala Lya Cya Pro Pro Gly Gin Tyr Val Lys 
50 55 ' 60 65 

CAT TTC TGC AAC AAG ACC TCG GAC ACC GTG TGT GCG GAC TGT GAG GCA 297 
His Pha Cys Asn Lys Thr Ser Asp Thr Val Cys Ala Aap Cys Glu Ala 
70 75 80 

AGC ATG TAT ACC CAG GTC TGG AAC CAG TTT CGT ACA TGT TTG AGC TGC 345 
Ser Met Tyr Thr Gin Val Trp Aan Gin Phe Arg Thr Cys Leu Ser Cys 
85 90 95 

AGT TCT TCC TGT ACC ACT GAC CAG GTG GAG ATC CGC GCC TGC ACT AAA 393 
Ser Ser Ser Cys Thr Thr Asp Gin Val Glu lie Arg Ala Cys Thr Lys 
100 105 110 

CAG CAG AAC CGA GTG TGT GCT TGC GAA GCT GGC AGG TAC TGC GCC TTG 441 
Gin Gin Asn Arg Val Cys Ala Cys Glu Ala Gly Arg Tyr Cys Ala Leu 
115 120 125 

AAA ACC CAT TCT GGC AGC TGT CGA CAG TGC ATG AGG CTG AGC AAG TGC 489 
Lys Thr His Ser Gly Sar Cys Arg Gin Cys Met Arg Leu Ser Lys Cys 
130 135 140 145 

GGC CCT GGC TTC GQA GTG GCC AGT TCA AGA GCC CCA AAT GGA AAT GTG 537 
Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Aan Gly Aan Val 
ISO 155 160 

CTA TGC AAG GCC TGT GCC CCA GGG ACG TTC TCT GAC ACC ACA TCA TCC 585 
Lau Cys Lys Ala Cys Ala Pro Gly Thr Phe Ser Aap Thr Thr Ser Ser 
165 170 175 

ACT GAT GTG TGC AGG CCC CAC CGC ATC TGT AGC ATC CTG GCT ATT CCC 633 
Thr Asp Val Cya Arg Pro Bis Arg Zle Cys Ser lie Leu Ala Zle Pro 
180 185 190 
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GGA AAT GCA AGC ACA GAT GCA GTC TGT GCG CCC GAG TCC CXA ACT CTA 681 
Gly Asn Ala Ser Thr Aap Ala Val Cys Ala Pro Glu Ser Pro Thr Lau 
195 200 205 

AGT GCC ATC CCA AGG ACA CTC TAC GTA TCT CAG CCA GAG CCC ACA AGA 729 
Sar Ala 11a Pro Arg Thr Lau Tyr Val Sar Gin Pro Glu Pro Thr Aro 
210 215 220 225 

TCC CAA CCC CTG GAT CAA GAG CCA GGG CCC AGC CAA ACT CCA AGC ATC 777 
Sar Gin Pro Lau Asp Gin Glu Pro Gly Pro Sar Gin Thr Pro Sar lie 
230 235 240 

CTT ACA TCG TTG GGT TCA ACC CCC ATT ATT GAA CAA AGT ACC AAG GGT 825 
Lau Thr Sar Lau Gly Sar Thr Pro Zla Ila Glu Gin Sar Thr Lys Gly 
245 250 255 

GGC ATC TCT CTT CCA ATT GGT CTG ATT 6TT GGA GTG ACA TCA CTG GGT 873 
Gly Zla Sar Lau Pro Zla Gly Lau Zla Val Gly Val Thr Sar Lau Gly 
260 265 270 

CTG CTG ATG TTA GGA CTG GTG AAC TGC ATC ATC CTG GTG CAG AGG AAA 921 
Lau Lau Mat Lau Gly Lau Val Aan Cys Zla Zla Lau Val Gin Aro Lya 
275 280 285 

AAG AAG CCC TCC TGC CTA CAA AGA GAT GCC AAG GTG CCT CAT GTG CCT 969 
Lys Lys Pro Sar Cys Lau Gin Arg Asp Ala Lys Val Pro His Val Pro 
290 295 300 305 

GAT GAG AAA TCC CAG GAT GCA GTA^«C CTT GAG CAG CAG CAC CTG TTG 1017 
Asp Glu Lys Sar Gin Asp Ala Val (a.y Lau Glu Glxf^ln His Lau Lau 
310 315 320 

ACC ACA GCA CCC AGT TCC AGC AGC AGC TCC CTA GAG AGC TCA GCC AGC 1065 
Thr Thr Ala Pro Sar Sar Sar Sar Sar Sar Lau Glu Sar Sar Ala Sar 
325 330 335 

GCT GGG GAC CGA AGG GCG CCC CCT GGG GGC CAT CCC CAA GCA AGA GTC 1113 
Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly His Pro Gin Ala Arg Val 
340 345 350 

ATG GCG GAG GCC CAA GGG TTT CAG GAG GCC CGT GCC AGC TCC AGG ATT 1161 
Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg Zle 
355 360 365 

TCA GAT TCT TCC CAC GGA AGC CAC GGG ACC CAC GTC AAC GTC ACC TGC 1209 
Ser Asp Ser Sec His Gly Ser His Gly Thr His Val Asn Val Thr Cys 
370 375 380 385 

ATC GTG AAC GTC TGTAGCAGCTCTGACCACAGTTCTCAGTGCTCTTCC 1257 
Zle Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Ser 
390 395 400 

CAA GCC AGC GCC ACA GTG GGA GAC CCAGAT6CCAAGCCCTCAGCGTCC 1305 
Gin Ala Ser Ala Thr Val Gly Asp Pro Asp Ala Lys Pro Ser Ala Ser 
405 410 415 

CCA AAG GAT GAS CAG. GTC CCC TTC TCT CAG GAG GAG TGT CC6 TCT CAG 1353 
Pro Lys Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cya Pro Ser Gin 
420 425 430 
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TCC CC6 TGT GAG ACT ACA GAG ACA CTG CAG AGC CAT GAG AAG CCC TTG 1401 
Ser Pro Cys Glu Thr Thr Glu Thr Lau Gin Ser His Glu Lys Pro Lau 
435 440 445 

CCC CTT GGT GTG CCG GAT ATG GGC ATG AAG CCC AGC CAA GCT GGC TGG 1449 
Pro Lau Gly Val Pro Asp Hat Gly Mat tya Pro Sar Gin Ala Gly Trp 
450 455 460 465 

TTT GAT CAG ATT GCA GTC AAA GTG GCC TGA CCCCTGACAG GGGTAACACC 1499 
Phe Asp Gin He Ala Val Lya Val Ala 

470 475 

CTGCAAAGGG ACCCCCGAGA CCCTGAACCC ATGGAACTTC ATGACTTTTG CTGGATCCAT 1559 

TTCCCTTAGT GGCTTCCAGA GCCCCAGTTG CAGGTCAAGT GAGGGCTGAG ACAGCTAGAG 1619 

TGGTCAAAAA CTGCCATGGT GTTTTATGGG GGCAGTCCCA GQAAGTTGTT GCTCTTCCAT 1679 

GACCCCTCTG GAICTCCTGQ QCTCTTGCCT GATTCTTGCT TCTGAOAGGC CCCAGTATTT 1739 

TTTCCTTCTA AGGAGCTAAC ATC C T C TTCC ATGAATAGCA CAGCTCTTCA GCCTGAATGC 1799 

TGACACTGCA GGGCGGTTCC AGCAAGTAGG AGCAAGTGGT GGCCTGGTAG GGCACAGAGG 1859 

CCCTTCAGGT TAGTGCTAAA CTCTTAGGAA GTACCCTCTC CAAGCCCACC GAAATTCTTT 1919 

TGATGCAAQA ATCAGAG6CC CCATCAGGCA GAGTTGCTCT GTTATAGGAT GGTAGGGCTG 1979 

TAACTCAGTG GTCCAGTGTG CTTTTAGCAT GCCCTGGGTT TGATCCTCAG CAACACATGC 2039 

AAAACGTAAG TAGACAGCAG ACAGCAGACA^(%ACAGCCAG -CCCCC TG TGT GGTTTGCAGC 2099 

CTCTGCCTTT GACTTTTACT CTGGTGGGCA CACAGAGGGC TGGAGCTCCT CCTCCTGACC 2159 

TTCTAATGAG CCCTTCCAAG GCCACGCCTT CCTTCAGGGA ATCTCAGGGA CTGTAGAGTT 2219 

CCCAGGCCCC TGCAGCCACC TGTCTCTTCC TACCTCAGCfr^GGAGCACTC CCTCTAACTC 2279 

CCCAACGGCT TGGTACTGTA CTTGCTGTGA CCCCAACGTG CATTGTCCGG GTTAGGCACT 2339 

GTGAGTTG6A ACAGCTCATG ACATCGGTTG AAAGGCCCAC CCGGAAACAG CTAAGCCAGC 2399 

TCTTTTGCCA AAGGATTCAT GCCGGTTTTC TAATCAACCT GCTCCCTAGC ATTGCCTGGA 2459 

AGGAAAGGGT TCAGGA6ACT CCTCAAGAAG CAAGTTCAGT CTCAGGTGCT TGGATGCCAT 2519 

GCTCACCGAT TCCACTGGAT ATGAACTTGG CAGAGGAGCC TAGTTGTTGC CATGGAGACT 2579 

TAAA6AGCTC AGCACTCTGG AATCAAGATA CTGGACACTT GGGGCCGACT TGTTAAGGCT 2639 

CTGCAGCATC AGACTGTAGA GGGGAAGGAA CACGTCTGCC CCCTGGTGGC CCGTCCTGGG 2699 

ATGACCTCGQ GCCTCCTAGG CAACAAAAGA ATGAATTGGA AAGGATGTTC CT GGG T G TGG 2759 

CCTAGCTCCT G T GC TTGTGT GQATCCCTAA AGGGTGTGCT AAGGAGCAAT TGCACTGTGT 2819 

GCTGGACAGA ATTCCTGCTT ATAAATGCTT TTTGTTGTTG TTTT G T A CAC TGAGCCCTGG 2879 

CTGAOXACC CCACCCCACC TCCCATCCCA CCTTTACACG CCACTCTTGC ATGAQAACCT 2939 

GGCTGTCTCC CACTTGTAGC CTGTGGATGC TGAGGAAACA CCCAGCCAAG TAGACTCCAG 2999 

GCTTGCCCCT ATCTCCTQCT ATGAGTCTGG CCTCCTCATT GTGTTGTGGG AAGQAGACGQ 3059 
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GTTCTGTCAT CTCGGAACGC CCACACCGTG GATGTGAACA ATGGCTGTAC TAGCTTAGAC 3119 

CAGCTTAGGG CTCTGCATAT CACAGGAGGG GGAGCAGGQA ACAATTTGAG TGCTGACCTA 3179 

TAACACAGTT CCTAAAGGAT CGGGCAGTCC AGAATCTCCT CCTTCAGTGT GTGTGTGTGT 3239 

GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT CCATGTTTGC ATGTATGTGT GTGCCAGTGT 3299 

GTGGAGGCCC GAGGTTGGCT TTGGGTGTGT TTGATCACTC TCCAGTTACT GAGGCGGGCT 3359 

CTCATCTGTA CCCAGAGCTT GCACATTTTC TAGTCTAACT TGATTCAGGG ATCTCTGTCT 3419 

GCCTATGGAG GTGCTCAGGT TACAGGCAGO CTGCCATACC TGCCCGACAT TTACATGAAT 3479 

ACTAGAGATC TGAATTCTGG TCCTCACACT TGTATACCTG CATTTTATCC ACTAAGACAT 3539 

CTCTCCAAGG GCTCCCCCTT CCTATTTAAT AAOTTAGTTT TGAACTGGCA AGATGGCTCA 3599 

GTGGGTAAGG CAGTTTGCGG ACAAACCTGA TGACCTGAGT TGGATCCCTG ACCATAAGGT 3659 

AGAAGAGACC TGATTCCTGC AAGTTGTCCT CTQACCACCA CCCCATACAT GCTTCTGCAT 3719 

ATGTGCACAC ATCACATTCT TGCACACACA CTCACATACC ATAAATGTAA TAAATTTTTT 3779 

TAAATAAATT GATTTTATCT TTTAAAAAAA AAAA 39x3 
(2) XN70BNATZ0N FOR SEQ ZD NO: 4: 

<i) SCQUCHCE CHARACTEiaSTICS: 

(A) LSKGTH: 475 fidao acida ^ . ^ 

(B) TYPE: aaino acid ^ 
(D) TOPOLOGY: linMr 

(ii) MOLECOLE TYPE: protain 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Mat Ala Pro Ala Ala Leu Trp Val Ala Leu Val Phe Glu Leu Gin Leu 
15 10 15 

Trp Ala Tbr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr 
20 25 30 

Lys Pro Glu Pro Gly Tyr Glu Cys Gin lie Ser Gin Glu Tyr Tyr Asp 
35 40 45 

Arg Lye Ala Gin Met Cya Cye Ala Lya Cya Pro Pro Gly Gin Tyr Val 
50 55 60 

Lya Hie Phe Cys Aen Lya Tbr Ser Aap Thr Val Cya Ala Asp Cva Glu 
6S 70 75 80 

Ala Ser Met Tyr Thr Gin Val Trp Aan Gin Phe Arg Thr Cys Leu Ser 
85 90 95 

Cys Ser Ser Ser Cys Thr Thr Asp Gin Val Glu He Arg Ala Cys Thr 
100 105 110 

Lys Gin Gin Asn Arg Val Cys Ala Cys Glu Ala Gly Arg Tyr Cvs Ala 
115 120 125 
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Leu Lya Thx His Ser Gly Sex Cys Azg Gin Cys Met Arg Leu S«r Lya 
130 135 140 



Cys Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Asn Gly Aan 
145 ISO 155 160 

Val Leu Cys Lys Ala Cys Ala Pro Gly Thr Phe Ser Asp Thr Thr Ser 
165 170 175 

Ser Thr Asp Val Cys Arg Pro His Arg Zle Cys Ser lie Leu Ala He 
180 185 190 

Pro Gly Asn Ala Ser Thr Asp Ala Val Cys Ala Pro Glu Ser Pro Thr 
195 200 205 

Leu Ser Ala Zle Pro Arg Thr Leu Tyx Val Ser Gin Pro Glu Pro Thr 
210 215 220 

Arg Ser Gin Pre Leu Asp Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser 
225 230 235 240 

Zle Leu Thr Ser Leu Gly Ser Thr Pro Zle Zle Glu Gin Ser Thr Lys 
245 250 255 

Gly Gly Zle Ser Leu Pro Zle Gly Leu Zle Val Gly Val Thr Ser Leu 
260 265 270 



Gly Leu Leu Met Leu Gly Leu val Asn Cys Zle Zle Leu val Gin Arg 



275 



280. 



283 



Lya Lys Lys Pro Ser Cys Leu Gin Arg Asp Ala Lys Val Pre His Val 
290 295 300 

Pro Aap Glu Lya Ser Gin Asp Ala Val Gly Leu Glu Gin Gin His Leu 
305 310 315 320 

Leu Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser I<eu Glu Ser Ser Ala 

325 330 335 

Ser Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly Bis Pro Gin Ala Arg 
340 345 350 

Val Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg 
355 360 365 

Zle Ser Aap Ser Ser His Gly Ser His Gly Thr His Val Asn Val Thr 
370 375 380 

Cys lie Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser 

385 390 395 400 

Ser Gin Ale Ser Ala Thx Val Gly Asp Pro Aap Ala Lys Pro Ser Ala 
405 410 415 

Ser Pro Lya Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cys Pro Ser 
420 425 430 

Gin Ser Pro Cys Glu Thx Thr Glu Thz Leu Gin Ser His Glu Lys Pro 
. 43S 440 445 



Leu Pro Leu Gly Val Pro Asp Met Gly Met Lys Pro Ser Gin Ala Gly 
450 455 460 
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Txp Pha Aap Gin Xle Xla Val Lys Val Ala . 
465 470 47S 
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Ki4 I 



BmTKF-S 



BaTKP.BAlfS 



HaTKF-BAi63 



MqIMF-B 
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nggRi ?H' 

GCG^GGCAGGCAGCCTGGAGAGAAGGCG 28 

CTGGGCTGCGXGGGCGCGAGGGCGCQAa^X&GGGGGCAACCGGXCCCCK 87 

ATG GCQ CCC ore GCC CTC TGG GCC GCO CTO GCC GTC GGA CTG GAG 132 

M«t Al« Pro Val Al« Val Trp Ala Ala Lau Ala Val Gly Lau Glu -8 

CTC TGO GCT GC6 GCG CACGCCTTGCCCGCCCAGGTOGCATTT ACA 177 

Lau Tzp Ala Ala Ala Hia Ala I^i Pro Ala Ola Val Ala Pha Thr 8 

CCC TAG GCC CCG GAG CCC GGG AGC ACA TGC CGG CTC AGA GAA TAC 222 

Pro Tyr Ala Pro Glu Pro Gly 8ar Thr Cya Arg Lau Arg Glu Tyr 23 

TAT QAC CAO ACA GCT CAG ATQ TGC TGC AGC AAA TGC TCQ CCG GGC 267 

Tyr Aap Gin Thr Ala Gin Mat Qya Cya Sar Lya Cya 8ar Pro Gly 38 

CAA CAT GCA AAA GTC TTC TOT ACC AAG ACC TCG GAC ACC GTG T6T 312 

Gin Hia Ala Lya Val Pha Cya Thr Lya Thr Sar Aap Thr Val Cya 53 

GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC TGG GTT 357 

Aap Sar Cya Glu Aap Sar Thr Tyr Thr Gin Lau Trp Aan Trp Val 68 

CCC GAG TGC TT6 AGC TGT GGC TCC CGC TGT AGC TCT GAC CAG GTG 402 

Pro Glu Cya Lau Sar Cya Gly Sar Arg Cya Sar Sar Aap Gin Val 83 

GAA ACT CAA GCC TGC ACT CGG GAA CAO AAC CGC ATC TGC ACC TGC 447 

Glu Thr Gin Ala Cya Thr Arg dlu Gin Aan Arg -ZAa cys thr Cya 98 

AGG Crc GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG 492 

Arg Pro Gly Trp Tyr Cya Ala Lau Sar Lya Gin Glu Gly Cya Arg 113 

CTG TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC 537 

Lau Cya Ala Pro Lau Arg Lya Cya Arg ProJUy Pha Gly Val Ala 128 

AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC 582 

Arg Pro Gly Thr Glu Thr Sar Aap Val Val Cya Lya Pro Cya Ala 143 

CCG GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG 627 

Pro Gly Thr Pha Sar Aan Thr Thr Sar Sar Thr Aap Zla Cya Arg 158 

CCC CAC CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC 672 

Pro Hia Gin Zla Cya ^Aan Val Val Ala Zla Pro Gly Aan Ala Sar 173 

ATG GATGCAGTCTGCACGTCCACGTCCCCCACCCGGAGTATGGCC 717 

Mat Aap Ala val Cya Thr Sar Thr Sar Pro Thr Arg^Sar Hat Ala 188 

CCA GGG GCA GZA CAC TTA CCC CAO CCA GTG TCC ACA CGA TCC CAA 7$2 

Pro Gly Ala Val Hia Lau Pro Gin Pro Val Sar Thr Arg Sar Gin 203 

CACACGCAOCCAACTOCAGAACOCAOCACTGCTCCAAGCACCTCC 807 

Hia Thr Gin Pro Thr Pro Glu Pro Sar Thr Ala Pro Sar Thr Sar 218 

TTC CTG CTC CCA ATG GGC CCC AGCCOCCCAGCTGAAGGGAGCACT 852 

Pha Lau Lau Pro Mat Gly Pro Sar Pro Pro Ala Glu Gly Sar Thr 233 

GGC QAC TTC GCT CTT CCA GTT GGA CTG ATT GTG GOT GIG ACA GCC 897 

Gly Aap Phil A\M Li»ti Pro Val Qlv t^ti rim Vm\ Gly Vm\ Th^ %\m 248 
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TTG GOT CTA CXA AtA ATA GGA GTQ GX6 AAC TGT GTC ATC ATQ ACC 942 

l^U GIV T^u LftU Tl#> Tla filv Vitl Val A.^n Py^ Val Tl^ u^f ^hr 263 

CA6 GTG AAA AAG MG CCC TT6 TGC CTO CAG AQA GAA GCC AAQ GTG 987 

Sin Yftl Lys Lya Lya Pro Leu Cys Lra Gin Arg Glu Ala Lys Val 278 

CCT CAC TTG CCT GCC GAT AAG GCC COG GGT ACA CAG GGC CCC GAG 1032 

Pro His Ii«tt Pro Ala Aap Lya Ala Ar? Gly Thr Gin Gly Pro Glu 293 

CAO CAG CAC CTG CTG ATC ACA GCO CCG AGC TCC AGC AGC AGC TCC 1077 

Gin Gin Hia Lau Ma Xla Thr Ala Pro Sar Ser Sar Sar Ser Sar 308 

CTG GAG AGC TCG GCC AGT GCG TTG QAC AGA AGO GCG CCC ACT CGG 1122 

Lau Glu Sar Sar Ala Sar Ala Lau Aap Arg Arg Ala Pro Thr Arg 323 

AAC CAG CCA CAG GCA CCA GGC GTQ QKG GCC AGT GGG GCC GGG GAG 1167 

Asn Gin Pro Gin Ala Pro Gly Val Glu Ala Sar Gly Ala Gly Glu 338 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT 1212 

Ala Arg Ala Sar Thr Gly Sar Sar Aap Sar Sar Pro Gly Gly Kia 353 

GGG ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC 1257 

Gly Thr Gin Val Aan Val Thr Cya Zla Val Asn Val Cys Sar Sar 368 

TCT GAC CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG 1302 

Sar Aap His Sar Sar Gin Cys Sj|^ Sar Gin Ala Sar Sar Thr Mat 383 _ 

GGA GAC ACA GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG 1347 

Gly Asp Thr Asp Sar Sar Pro Sar Glu Sar Pro Lys Asp Glu Gin 398 

GTC CCC TTC TCC AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG 1392 

Val Pro Pha Sar Lys Glu Glu Cys Ala Pha Arg Sar Gin Leu Glu 413 

ACG CCA GAG ACC CTG CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC 1437 

Thr Pro Glu Thr Lau Lau Gly Sar Thr Glu Glu Lys Pro Lau Pro 428 

CTT GGA GTG CCT GAT GCT GGG ATG AAG CCC AGT 1470 

Lau Gly Val Pro Asp Ala Gly Mat Lys Pro Sar 439 



TAACCAGGCCG6TGTGGGCTGTGTCGTAGCCAAGGTGGGCTGAGCCCTGGCAGGATGAC 



GGQCCAAGTTCCTCTAGTGCCCTCCACAGCCGCAGCCTCCCTCTGACCTGCAG. . . 
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rxnn M 

CGCAGCTGAGGCACTAGA6CTCC 23 

AGGCACAA6G6CGGGAGCCACCGCTGCCCCT AT6 GC6 CCC GCC GCC CTC TGG 75 

M*t Ala Pre Ala Ala Lau Trp -16 

GTC GC6 CTO GTC TTC GAA CTO CAS CTG TGG GCC ACC GGG CAC ACA 120 

Val Ala L»\i val Ph« Olu Uu 61a Lau Tip Ala Thr Gly His Thr >l 

GTO CCC GCC CAG GTT GTC WG ACA CCC tAC AAA CCG GJUi OCT GGG 165 

HaL Pro Ala Gla Val Val L«u Thr Pro Tyr Lya Pro Glu Pro Gly 15 

TAC GAG TGC CAG ATC TCA CAG GAA lAC TAT 6AC AGG AAG GCI GAG 210 

Tyr Glu Cys Gin Zla Sar Gin Glu lyr Tyr Asp Axg Z,ya Ala Gin 30 

ATO TGC TGT GCI AAG TGT CCT CCT GGC CAA TAT GTG AAA CAT TTC 255 

Hat Cys Cya Ala Lys Cya Pro Pro Gly Gin Tyr Val Lya Hia Pha 45 

TGC AAC AAG ACC TC6 GAC ACC GTG TGT GCG GAC TGT GAG GCA AGC 300 

Cya Aan Lya Thr Sar Aap Thr Val^ Cya Ala Aap Cya Glu Ala Sar 60 

ATG TAT ACC CAG GTC TGG AAC CAG TTT CGT ACA TGT TTG AGC TGC 345 

Mat Tyr Thr Gin Val Trp Aaa Gin Pha Arg Thr Cys Uu Sar Cys 75 

A6T TCT TCC TGT ACC ACT GAC CAG GTG GAG ATC C6C GCC TGC ACT 390 

Sar Sar Sar Cya Thr Thr Aap Gin Val Glu Ila Arg Ala Cys Thr 90 

AAA CAG CAG AAC C6A GTG TGT GCT TGC GAA GCT GGC A6G TAC TgC 435 

Lya Gin Gin Aan Arg Val Cys xla Cya Glu -Ala ^ly Arg Tyr Cys 105 

GCC TTG AAA ACC CAT TCT GGC AGC TOT CGA CAG TGC ATG AGG CTG 480 

Ala Lau Lys Thr His Sar Gly Sar Cya Arg Gin Cys Mat Arg Lau 120 

AGC AAG TGC GGC CCT GGC TTC GGA GTG GCC AGT TCA AGA GCC CCA 525 

Sar Lya Cys Gly Pro Gly Pha Gly Val. Al^jSar Sar Arg Ala Pro 135 

AAT GGA AAI GTG CTA TGC AAG GCC TGT GCC CCA GGG ACG TTC TCT 570 

Aan Gly Aan Val Lau Cya Lys Ala Cya Ala Pro Gly Thr Pha Sar 150 

GAC ACC ACA TCA TCC ACT GAT GTG TGC AGG OCC CAC CGC ATC TGT 615 

Asp Thr Thr Sar Sar Thr Asp Val Cya Arg Pro Hia Arg Zla Cya 165 

AGC ATC CTG GCT ATT CCC GGA AAT GCA AGC ACA GAT GCA GTC TGT 660 

Sar Zla Lau Ala Zla Pro Gly Aan Ala Sar Thr Asp Ala Val Cys 180 

GCG CCC GAG TCC CCA ACT CTA AGT GCC ATC CCA AGG ACA CTC TAC 705 

Ala Pre Glu Sar Pre Thr Lau Sar Ala Zla Pro Arg Thx Lau Tyr 195 

GTA TCT CAG CCA GAG CCC ACA AGA TCC CAA CCC CTG GAT CAA GAG 750 

Val Sar Gin Pro Glu Pre Thr Arg Sar Gin Pto Lau Aap Gin Glu 210 

CCA GOO CCC AOC CAA ACT CCA AGC ATC CTT ACA TCO TTO G6T TCA 795 

Pre Gly Pre Sar Gin Thr Pre Sar Zla Lau Thr Sar Lau Gly Sac 225 

ACC CCC ATT ATT GAA CAA AGT AOC AAG GGT GGC ATC TCT CTT CCA 840 

Thx Pre Zla Ila Glu Gin Sar Thr tM» ti\y aiv r\m smr t»ti 9^t% 240 

ATT GGT CTO ATT GTT GGA GTG ACA TCA CTG GOT CTO CTO ATO TTA 685 

Ila filv t^u Tli> Viil aw v»\ ThT am^ t^i, niy t^„ t^^ 255 



37 



EP 0 418 014 A1 



GGA CTO GTG AAC TGC ATC ATC CTG 6T6 CA6 AGG AAA AA6 AAG CCC 930 

fliv T^u v«l A»fi rv« Till Tim t^ii v«i Lys lys Ly« »ro 270 

TCC T6C CTA CAA AOA OAT GCC AA6 GTO CCT CAt 610 CCT GAT GAG 975 

S«r Cya Uu Ola Axg A«p Ala ty V«l Pre Bis V«l Pre Asp Olu 285 

AAA ICC CAO OAT OCA OTA OOC CTT OAO CAS CAO CAC CTQ TtO ACC 1020 

tys Smz Ola Asp Als Vsl Oly Utt Olu Ola Ola Bis Uu Leu Thr 300 

ACA OCA CCC AGt TCC ACC AGC AOe tec CTA GAO A6C TCA GCC AGC 1065 

Thr Ala Pre 8«r S«r S«c Sar Sar Sar Uu Olu Sar Sar Ala Sar 315 

GCTGOOQACCQAAGGGCecCCCCTOOOeOCCArCCCCAAeCAAGA 1110 

Ala Oly Asp Axg Arg Ala Pre Pre Oly Oly His Pre Ola Ala Arg 330 

GICAT0GC8QAGGCCCAA0GGTTTCA0GA0GCCC6T0CCAGCTCC 1155 

val Mat Ala Olu Ala Ola Gly Pha Ola Olu Ala Arg Ala Sar Sar 345 

AGO ATT TCA OAT TCT TCC CAC OGA ASC CAC 000 ACC CAC ore AAC 1200 

Arg Zla Sar Asp Sar Sar ais Oly Sar His Oly Thr ais Val Asa 360 

OTC ACC TOC ATC OTO AAC OTC TOT AGC AGC TCT OAC CAC AOT TCT 1245 

Val Thr Cys Zla Val Asa Val Cys Sar Sar Sar Asp Rls Sar Sar 375 

CAOTGCTCTTCeCAAGCCAGCGCCACAGTGGGAGACCCAGATGCC 1290 

Ola Cy* Sar Sar Ola Ala Sar Ala Thr Val Oly Asp Pre Asp Ala 390 

AAGCCCTCAOeGTCCCCAAAGGATGASCAGOTCCCCTTCTCTCAG 1335 

tys Pre Sar Ala Sar Pre Lys Asiv Qlu Ola Val Pre Pha Sar Gla 405 

GAG GAG TCT CCG TCT CAG TCC CC6 T6T GAG ACT ACA GAG ACA CTG 1380 

Glu Glu Cy» Pro S«r Gin Sar Pro Cya Giu Thr Thr Glu Thr L«u 420 

CAG AGC CAT GAG AAG CCC TTG CCC CTT GGT GTG CCG GAT ATG GGC X425 

Gin S«r His Glu Lys Pro Leu Pro Lou Gly Val Pro Aap Met Gly 435 

ATG AAG CCC AGC CAA GCT GGC TOO TTT GAT CAG ATT 6CA GTC AAA 1470 

Met Lya Pro d«r Gin Ala Gly Trp Pho Aap Oln Zla Ala Val Lya 450 



GTG GCC 
val Ala 



1476 
452 



TGACCCC TGACAG GGGTAACACCCTQCAAAGOGACCCCCGAGACCCTGAACCCATGGAAC 1536 

TTCATGACTTTTGCTOGATCCATTTCCCTTAGTOGCTTCCAQAGCCCCAOTTGCAGGTCA 1596 

AGTQAGOGCTGAGACAGCTAGAGTGOTCAAAAACTGCCATGGTGTTTTATGGGGGCAGTC 1656 

C CAGGAAQ TTGTTGCTCTtCCATGACCCCTCTGGATCTCCTGGGCTCTTGCCTGATTCTT 1716 

GCTT CT»Qy K;CCCCAGTATTTTr TC CTTCtAAGQAGCTAACATCCTCTTC C ATGAATA 1776 

GCACAOCTCTTCAGCCTGAATGCTGACACTGCAGGGCGGTTCCAGCAAGtAGQAGCAAGT 1836 

G G T GGC CT GG T A OGGCACAGAGGCCCTTCAGGTTAQTGCTAAACTCTTAGGAAGTACCCt 1896 

CTrr A AG CCC A CCGAAATTCTTTTGATGCAAGAATCAGAGGCCCCATCAGGCAgAQTTqC 1956 

T CTOT TAtAGQATGGTAGGGCTOTAACTCAgTGGTCCA ^ T G T GC I I I ' XA GCATGCCCTQG 2016 

GTTTGATOCTCAGCAACACATGCAAAACGIAAQTAGACAGCAQACAGCAGACAGCACAGC 2076 

CA^CWT Oro i Wn 2136 

QG^TCT CAGOGACTGTAGAOTTCCCAGGCCCCTGCAGCCA CC T G TCTCrTCCTlCCTCA 2256 

GCCTOGAGCAcTCCCTCTAACTCCCCAACGgCTTOGTACTOTACTTGCTOTGACCOCAAC 2316 

GTGCATTGTCCOOOTTAGOCACTOTOAGTTGGAACAGCTcATGACATCGGTTGAAAGGCC 2376 

CACCCGQAAACAGCTAAGCCA GC I Cli 1 XU CCAAAGGATTCATG CCOG l'tl TC l' A ATCAA 2436 

CCTGCTCCCTAGCATT0CCTGGAAGGAAAGGGTTCAGGAQACTCCTCAAGAAGCAA6TTC 2496 

AGTCTCAGGTGCTTGGATGCCATGCTCACCGATTCCACTGGATATGAACTTGGCAGAGQA 2556 
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GCCTAGTT6TTGCCATGGXGXCTTAAA6AGCTCA6CACTCTGGXATCAAGATACTGGACA 2616 

CTT<»GGCCGACTTGtTAAGGCTCtGCAGCATCAGACX6TAGAGGG6MG^ 2676 

GCCCCCtGGTGGCCCGTCCTGGQAtGACCTCGGGCCtCCTAGGCAACAXAAGAAT^ 2736 

GGAAAGGATGTTCCTGGGtGTGGCCTAGCTCCTGTGCTTGTGTGGATCC 2796 

GCTAAGGAGCAAtTGCACTGTGTGCTGGACAGAATTCCTGCTTATAAATGCTTm 2856 

TT GITTI STACACTGAGCCCT»CTG A GCCACCCC A CCCCACCTCCCAT^^ 2916 

ACQCCACtCTtGCAtGAGAACCTGGCTGTCTCCCACTTGTAGCCTGtGGAt^^ 2976 

ACACCai6CCAAGtAQACTCCAGGCTTgCCCCTATCTCCTGcT«TGACTeTg9CCT^^ 3036 

At TgXGTTGTGGQAAgGAGACGGGtTCTQTCATCTCGQAAegCCCACACCGTGtt 3096 

ACA«TGGCTGtACTAGCTTAGACCA9CTTAG6GCTCT6CATATCACAa 3156 

GGAACAAnTQAGTGCTGACCTATAACACAgTTCCTAAAGQAtCGGGCAGhr^ 321 6 

CCTCCTTCAGTGTgTGTOTGTGTG TG TGT G TGTGTGT G TGTGTGTGTCTQTGtC^ 327 6 

TGCATGtAtGTGTGTGCCAgTCTGTGGACWCCCGAGGTTGGCTTT GW 3336 

CTCTCCAGTTACTGM^GCGGGCTCraiTCTGTACCCAGAGCtTGCAC^ 3396 

ACTTGATTCAGGCaTCTCTGT C TG CC tAtGGAGgTGCTCJMSGT^^ 3456 

ACCT GC CCG&CATTtACATaUiTACTAGAGATCTGAATTCTgGTCCTCAC^ 3516 

CTGCATTTTATCCACtAAGACAtCTCTCCAAGGGCTCCCCCTTCCTATTTAAT^ 357 6 

TTTtGAACTGGCAAGATGGCTCAGTGGGTAA(X;CA6TTTGCGGACAAACC^ 3636 

AGTtGGATCCCTGACCATAAGGTAGAAGAGACCTQATTCCTGCAAGTTGTCCTCTGM 3696 

CCACCCCATACATGCTTCTGCATATGTGCACACATCAaiTTCtTGCACACACACTGA 3756 

ACCATAAATGtAATAAAtTTTTTTAAAtAAATTQATTtTAtCTTTlAAAAAAAAAAA 3813 
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